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REMARKS 

After entry of this amendment, claims 2-15, 23, 28, 31-33, and 50-65 are pending. The 
claims have been amended without prejudice or disclaimer to correct the dependency in light of 
the cancellation of claim 1 made in the Response dated August 13, 2009. Support is found inter 
alia in the original claims. No new matter has been added. 

Claim Rejections - 35 U.S.C. $ 112 

Claims 2-15, 23,28, 31-33 and 50-65 are rejected under 35 U.S.C. § 112, first paragraph, 
for allegedly lacking an enabling disclosure. 

The Examiner asserts that deregulating including overexpressing any of the genes 
encoding enzymes and/or proteins of the pantothenate biosynthetic pathway and overexpressing 
any of the genes encoding enzymes and/or proteins of the methylenetetrahydro folate (MTF) 
biosynthetic pathway will involve manipulating any of the genes encoding any and all enzymes 
and/or proteins associated with any of the entire metabolism pathways. To do so, the Examiner 
contends that one skilled artisan would need to have the knowledge of the pantothenate 
biosynthetic genes, the isoleucine-valine (ilv) biosynthetic genes, and the MTF biosynthetic 
genes. The Examiner, however, alleges that many of these genes have not been discovered. The 
Examiner further contends that one skilled artisan would also need to have the knowledge of 
genes regulating pantothenate kinase or manipulating the kinase activity. Asserting that mere 
knowledge of the biochemical or metabolic pathways leading to the production of a compound is 
not sufficient to overexpress any of the genes associated with the pathways, the Examiner 
concludes that undue experimentation would be required to search and screen for any 
deregulation of any enzymes and/or proteins of the above pathways and to determine whether 
such deregulation leads to an enhanced production of pantothenate. Applicants strongly disagree 
and traverse the rejection for the reasons already of record and for the following additional 
reasons. 

It is noted initially that the disclosure provided in the specification is presumptively 
enabling. The manner of making and using the claimed invention must be taken as in 
compliance with the first paragraph of 35 U.S.C. §112, unless there is objective evidence or 
scientifically based reasoning inconsistent with the specification. See In re Marzocchii and 
Horton, 169 U.S.P.Q. 367 (C.C.P.A. 1971). "It is the Patent Office's burden to present evidence 
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that there is some reason to dispute the enablement provided in the specification. Unsupported 
speculation or conjecture on that the invention 'might not work' will not support a rejection 
based on 35 U.S.C. §112, first paragraph." Id. Simply pointing to the absence of a working 
example provides neither objective evidence nor reasoning in support of the rejection, and 
accordingly, a prima facie case of non-enablement on this ground has not been made out. 
Moreover, there has never been a requirement that every species encompassed by a claim must 
be disclosed or exemplified in a working example. In re Angstadt, 537 F.2d 498 (CCPA 1976). 
Additionally, even though practicing the full scope of the claims might have required some 
amount of experimentation, if the experimental techniques are well-known in the art, the 
experimentation is routine and not undue. See Ex parte Kubin, 83 USPQ2d 1410 (B.P ,A.L 
2007), affd on other ground, 90 USPQ 2d 1417 (Fed. Cir. 2009). 

Here, the claims are directed to a process for the enhanced production of pantothenate 
comprising culturing a recombinant microorganism of certain genera having the characteristics 
as specified in the claims, i.e. a deregulated pantothenate biosynthetic pathway and a deregulated 
MTF biosynthetic pathway, under conditions such that pantothenate production is enhanced as 
compared to a corresponding wild-type microorganism. The claims further specify that the 
microorganism suitable for practicing the claimed process should be derived from the genus of 
Bacillus, Corynebacterium, Lactobacillus, Streptomyces, Salmonella, Escherichia, Klebsiella, 
Serratia, Proteus, or Saccharomyces. Additionally, the "deregulated MTF biosynthetic 
pathway" is further defined as being by way of overexpressing at least one of the gcv, serA, serC, 
serB, glyA, sul,fol, mtrA,pab,panB and purR genes derived from Bacillus, Corynebacterium, 
Lactobacillus, Lactococci, or Streptomyces. Thus, the claims are not unduly broad as alleged by 
the Examiner. Rather, the subject matter encompassed by the present claims are well defined 
and the specification, together with the knowledge of the art, provides sufficient guidance, 
including working examples, to make and use the claimed process without undue 
experimentation. 

It is noted initially that, as acknowledged by the Examiner, the specification provides 
working examples demonstrating how to make and use the claimed process by overexpressing 
several different genes in Bacillus, The working examples provided in the specification 
describe, in detail, how to generate Bacillus strains with a deregulated pantothenate biosynthetic 
pathway (Example I), how to isolate genes involving in the MTF biosynthetic pathway 
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(Examples III and IV) or regulating the MTF production (Example VI), how to construct an 
expression cassette to overexpress a gene in a microorganism (e.g. Examples III and IV), how to 
produce Bacillus strains with a deregulated MTF biosynthetic pathway (Examples III, IV, and 
VI-IX), and how to detect the effect of increased gene expression on pantothenate production 
(Examples I and III-V). The methodologies described in the specification are applicable not only 
to the genes exemplified in the working examples, but also any other genes that are identified as 
being involving in the pantothenate biosynthetic pathway and/or the MTF biosynthetic pathway. 
Moreover, the specification further provides guidance as to elements necessary to express genes 
in a microorganism such as promoter sequences (page 20, lines 1-27), selectable marker 
sequences (page 22, lines 10-24), and artificial ribosome binding sites (page 21, lines 2-31). 
Additionally, Applicants submit that constructing vectors for overexpressing genes in 
microorganisms such as those recited in the claims is routinely used by one skilled in the art. 
Thus, although some testing and screening would be required to identify a recombinant 
microorganism useful for practicing the claimed process, such testing and screening would not 
be extensive and is routine in nature, and thus, not undue. See Ex parte Jacfoon, 217 USPQ 804, 
807 (1982) (the test for "undue experimentation" is not merely quantitative, since a considerable 
amount of experimentation is permissible, if it is merely routine); see also Ex parte Kubin, 83 
USPQ2d 1410 (B.P.A.I. 2007), affd on other ground, 90 USPQ 2d 1417 (Fed. Cir. 2009) (even 
though practicing the full scope of the claims might have required some amount of 
experimentation, if the experimental techniques are well-known in the art, the experimentation is 
routine and not undue). 

Moreover, it is further noted that, as well known in the art, pantothenate is naturally 
synthesized in bacteria, fungi, and plants. Thus, the microorganisms recited in the claims as 
being suitable for practicing the claimed process are pantothenate-producing microorganisms. 
As such, these microorganisms will naturally possess a pantothenate biosynthetic pathway as 
well as genes required for the production of pantothenate. For the same reason, these 
microorganisms would also be expected to naturally possess a MTF biosynthetic pathway as well 
as genes involved in this pathway because, as described in the specification, the MTF 
biosynthetic pathway supplies a substrate for the production of pantothenate. As demonstrated in 
Figures 1 and 2 of the specification, the pantothenate biosynthetic pathway, the isoleucine-valine 
(ilv) biosynthetic pathway, and the MTF biosynthetic pathway, including enzymes and/or 
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proteins involved in each step leading to the production of pantothenate, are well known in the 
art. This is further evidenced by Sahm et al (Applied and Environmental Microbiology, 1999, 
65(5): 1973-1979, cited in IDS dated October 3, 2005; see Fig. 1 at page 1974) and Voet et al 
(Biochemistry, Second Edition, 1995, John Wiley & Sons, Inc., pp. 761-764, cited in the 
Response dated August 13, 2009; see particularly Figure 24-39 at page 763). Thus, the 
specification and the art provide the knowledge of the pantothenate biosynthetic genes, the 
isoleucine-valine (ilv) biosynthetic genes, and the MTF biosynthetic genes necessary for one 
skilled artisan to practice the claimed process. 

Additionally, Applicants also note that, contrary to the Examiner's assertion, many of the 
genes involved in the pantothenate biosynthetic pathway and the MTF biosynthetic pathway, 
including the genes recited in the claims for deregulating the MTF pathway, have been isolated 
from various organisms. For instance, the serA gene has been isolated from species such as E. 
coli 9 Bacillus, and Saccharomyces cerevisiae. Similarly, the glyA gene has been isolated from 
species such as Saccharomyces cerevisiae, E, colt, Salmonella typhimurium, and Streptomyces 
coelicolor. A Table providing examples of genes recited in the claims that have been isolated is 
attached herewith for the Examiner's reference. The articles or references cited in the enclosed 
Table can be made available for the Examiner's review upon request. As a person of ordinary 
skill in the art involved in the production of pantothenate would have an advanced degree (likely 
a Ph.D. degree) in the molecular biological science and several years of work experience, 
identifying functional homologues or equivalents from other microorganisms would be simply 
routine practice. For example, degenerated primers may be used to isolate homologues or 
equivalents of a known gene from the gene pool of another microorganism, which can then be 
tested for its enzymatic function in a complementation assay. Methodologies or techniques 
required for identifying and isolating homologues or equivalents of a known gene are well 
known in the art and within the knowledge and experience level of one of ordinary skill. 
Moreover, with the increasing availability of complete genome sequences of many other 
microorganisms, one skilled in the art would be able to easily identify and screen for equivalent 
genes suitable for practicing the claimed process. 1 With this regard, Applicants note that detailed 

1 According to the NCBI website, more than 1000 prokaryotic genomes are now completed and available in 

the Genome database, including the genera of microorganisms recited in the claims, i.e. Bacillus (106 entries), 
Coiynebacterium (25 entries), Lactobacillus (59 entries), Lactococci (4 entries), and Streptomyces (22 entries). See 
http://www.ncbi.nlm.nih.gov/genomes/MICROBES/niicrobialJaxtree.html. 
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guidance as to how to isolate genes from other microorganisms is provided throughout the 
specification, including working examples, as discussed above. See e.g., Specification at page 
17, lines 4-15, and Examples III and IV. Accordingly, only very extensive and very 
unpredictable experimentation would rise to the level of "undue" in this art, which is clearly not 
present in the instant application. 

The routine nature of practicing the claimed subject matter based on the detailed 
guidance provided in the specification and the knowledge of the art is further evidenced by, for 
example, Collet et al (FEBS Letters, 1997, 408: 281-284, copy attached; hereinafter "Collet") 
and Ho et al (Plant J., 1998, 16(4): 443-452, copy attached; hereinafter "Ho"). In Collet, a 
human serB gene was cloned by PCR amplification using primers derived from human EST 
sequences which were identified in a Blast search using the serB sequence from S. mansoni. The 
enzyme encoded by the isolated human serB gene shares 30% and 40% sequence identity with 
the enzymes from E. coli and S. mansoni, respectively, and was shown to have the phosphoserine 
phosphatase activity (Collet at page 283, left Col.). Similarly, in Ho, &n Arabidopsis serC gene 
was isolated from a cDNA library using the insert of an Arabidopsis EST clone showing 55-67% 
homology to bacterial and yeast serC protein as probe. The Arabidopsis serC protein shares 34- 
71% homology with bacterial, yeast and spinach serC proteins (Ho at page 444, right Col, 2 nd 
full paragraph) and was shown to have the phosphoserine aminotransferase activity (Ho at page 
446, right Col, 2 nd full paragraph). These results, although obtained from organisms different 
from those recited in the claims, demonstrate nevertheless that it is merely routine for one skilled 
in the art to identify and isolate a gene involved in the pantothenate biosynthetic pathway and/or 
the MTF biosynthetic pathway. 

Applicants additionally note that, in both Collet and Ho, sequence alignments between 
the serB proteins and the serC proteins, respectively, from various organisms were provided. 
For example, in Collet, the serB proteins from rat, S. mansoni, human, E. coli, H. influenzae, 
Methanococcus jannaschii and yeast were aligned, and conserved regions were identified (Collet 
at page 282, Figure 1). Likewise, in Ho, the serC proteins from Arabidopsis, spinach, Bacillus 
circulans, E, coli, rabbit, and yeast were aligned, and signature sequences for serC protein as 
well as conserved sequence motifs were identified (Ho at page 445, Figure 2). Similar disclosure 
and identification of conserved sequences were also available for other genes involved in the 
pantothenate biosynthetic pathway and/or the MTF biosynthetic pathway, for example, glyA 
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gene (see e.g., Vatcher et al., J. of Biological Chemistry, 1998, 273(11): 6066-6073, copy 
attached) and serA gene (see e.g., Achouri et al, Biochem. J., 1997, 323: 365-370, copy 
attached). Based on the abundant information available in the art as illustrated in the above 
references, one skilled artisan would be able to easily identify the consensus sequence(s) for the 
gene involved in the pantothenate biosynthetic pathway and/or the MTF biosynthetic pathway, 
including the gene recited in the claims for deregulating the MTF pathway, and to clone such a 
gene using the methodologies or techniques taught in the specification or known in the art. As 
demonstrated by Collet and Ho, the testing and screening required for such identification and 
isolation is simply routine, and not undue. 

In view of the detailed description, guidance and working examples provided in the 
specification, as well as the state and knowledge of the art and high level of skill, Applicants 
respectfully submit that the specification enables the full scope of the claims without undue 
experimentation as further evidenced by the aforementioned references. On these facts, a proper 
analysis of the relevant factors supports enablement. In re Wands, 858 F,2d 731, 737 (Fed. Cir. 
1988). For at least the above reasons and for the reasons already of record, reconsideration and 
withdrawal of the enablement rejection is respectfully requested, 

Moreover, Applicants note again that the initial burden is on the Patent Office to present 
objective evidence or scientifically based reasoning to dispute the presumptively enabling 
disclosure provided in the specification in light of the knowledge of the art. Simply pointing to 
the absence of a working example is neither objective evidence nor reasoning to support a 
nonenablement rejection. Unsupported speculation or conjecture on that the invention "might 
not work" will also not support such a rejection. Additionally, some amount of experimentation 
will not defeat enablement so long as it is routine and not undue. Since the Examiner has not 
provided objective evidence or scientifically based reasoning to dispute the enablement provided 
in the specification and the art, a prima facie case of non-enablement has not been established. 
For this additional reason, the nonenablement rejection should be withdrawn. 

Claim Rejections - Double Patenting 

Claims 2-15, 23, 28, 31-33 and 50-65 are rejected on the ground of nonstatutory 
obviousness-type double patenting over claims 1-17 of U.S. Pat. No. 7,291,789 and claims 1-34 
of U.S. Pat. No. 7,244,593. Claims 2-15, 23, 28, 31-33 and 50-65 are further provisionally 
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rejected for obviousness-type double patenting over claims 2-6, 1 1-26, 29-32 and 35-40 of co- 
pending Application No. 1 1/879,143. Applicants will consider filling a terminal disclaimer upon 
an indication that the claims are allowable. 

Power of Attorney 

A Transmittal of Power of Attorney with Change of Correspondence Address together 
with the executed Power Of Attorney To Prosecute Applications Before the USPTO and a 
Statement Under 37 CFR § 3.73(b) were filed in the present application on March 12, 2009. 
Applicants respectfully request that all pertinent U.S. Patent and Trademark Office records 
relating to the subject application be changed accordingly. 

CONCLUSION 

In view of the above remarks and further in view of the above amendments, Applicants 
respectfully request withdrawal of the rejection and allowance of the claims. If any outstanding 
issues remain, the Examiner is invited to telephone the undersigned at the number given below. 

This response is filed within the three-month period for response from the mailing of the 
Office Communication, to and including March 1, 2010. No fee is believed due. However, if a 
fee is due, please charge our Deposit Account No. 03-2775, under Order No. 1331 1-00036-US 
from which the undersigned is authorized to draw. 

Respectfully submitted, 

B y I <Mu-^W*t 

Hui-Ju Wu, Ph.D. 

Registration No.: 57,209 
CONNOLLY BOVE LODGE & HUTZ LLP 
1007 North Orange Street; P. O. Box 2207 
Wilmington, Delaware 19899-2207 
(302) 658-9141 
(302) 658-5614 (Fax) 
Attorney for Applicants 

#741506 
Attachments: 

1 . Table exemplifying genes recited in the claims that have been isolated. 

2. Collet et al, FEBS Letters, 1997, 408: 281-284. 

3. Ho et al, Plant J., 1998, 16(4): 443-452. 

4. Vatcher et al, J. of Biological Chemistry, 1998, 273(1 1): 6066-6073. 

5. Achouri et al, Biochem. J., 1997, 323: 365-370. 
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Attachment 1 : 



Gene 


Organism 


Reference(s) 


serA 


E. coli 


Tobey et a/., J. Biol. Chem., 1986, 261: 12179-12183 


H. influenzae 


Genbank accession No. L45106 


Yeast 


Genbank accession No. P40054 


Bacillus 


Genbank accession No. L47648 


Rat 


Achouri et al. 9 Biochem. J., 1997, 323: 365-370 


serB 


E. coli 


Genbank accession No. P06862 


Bacillus subtilis 


Genbank accession No. NP 391291 


H. influenzae 


Genbank accession No. L42023 


Human 


Collet etal, FEBS Letters, 1997, 408: 281-284 


serC 


H. influenzae 


Genbank accession No. P44336 


Arabidopsis 


Ho etal, Plant J., 1998, 16(4): 443-452 


Spinach 


Genbank accession No. D84061 


Bacillus circulans 


Genbank accession No. Z46432 ! 


gcv 


E. coli 


Okamura-Ikeda et al, Eur. J. Biochem., 1993, 216: 539- 

548 




E. coli 


Genbank accession No. AAC75604 


C. elegans 


Vatcher et al 9 J. Biol Chem., 1998, 278(1 1): 6066-6073 


Salmonella 
typhimurium 


Steiertef a/.,DNASeq., 1990, 1(2): 107-113 


Streptomyces 
coelicolor 


Smiths* a/., Mol. Gen. Genet., 1988,211(1): 129-137 


sul/sfi 


E. coli 


Jones et al, PNAS, 1985, 82: 6045-6049 
Kanemori et al, J. Bacterid., 1999, 181(12): 3674-3680 


fol 


E. coli 


Genbank accession No. NP 414590 


Mycobacteria 


Barrow et al., U.S. Pat. No. 6,229,001 


mtrA 


Methanobacterium 
thermoautotrophicum 


Stupperich^a/.,Eur. J. Biochem., 1993,217(1): 115- 

121 


Methanopyrus 
kandleri 


Harms et al, Eur. J. Biochem., 1997, 250: 783-788 


Methanosarcina 
barkeri 


Harms et al, Eur. J. Biochem., 1997, 250: 783-788 


pab 


Yeast 


Genbank accession No. P37254 


panB 


E. coli 


Jones et al, J. BacterioL, 1993, 175(7): 2125-2130 


purR 


E. coli 


He et al, J. BacterioL, 1992, 174(1): 130-136 
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Abstract We report the sequence of the cDNA encoding human 
L-3-phosphoserine phosphatase. The encoded polypeptide con- 
tains 225 residues and shows 30% sequence identity with the 
Escherichia coli enzyme. The human protein was expressed in a 
bacterial expression system and purified. Similar to known l-3- 
phosphoserine phosphatases, it catalyzed the Mg 2+ -dependent 
hydrolysis of L-phosphoserine and an exchange reaction between 
L-serine and L-phosphoserine. In addition we found that the 
enzyme was phosphorylated upon incubation with l- 
[ 32 P]phosphoserine, which indicates that the reaction mechanism 
proceeds via the formation of a phosphoryl-enzyme intermediate. 
The sensitivity of the phosphoryl-enzyme to alkali and to 
hydroxylamine suggests that an aspartyl- or a glutamyl- 
phosphate was formed. The nucleotide sequence of the cDNA 
described in this article has been deposited in the EMBL data 
base under accession number Y10275. 

© 1997 Federation of European Biochemical Societies. 



1. Introduction 

L-3-Phosphoserine phosphatase is the enzyme responsible 
for the third and last step in L-serine formation, This enzyme 
not only catalyzes the Mg 2+ -dependent hydrolysis of L-phos- 
phoserine but also an exchange reaction between L-serine and 
L-phosphoserine [1,2], suggesting that its reaction mechanism 
proceeds through the formation of a phosphoryl-enzyme in- 
termediate; the latter has, however, never been directly dem- 
onstrated. As L-3-phosphoserine phosphatase has been re- 
ported to be deficient in one case of L-serine deficiency [3], 
it was of interest to determine the sequence of the human 
enzyme, Databanks contain DNA sequences of several bacte- 
ria and of two eukaryotes (Saccharomyces cerevisiae and 
Schistosoma mansoni) that are homologous to SerB, the 
gene encoding Escherichia coli L-3-phosphoserine phosphatase 
[4]- 

In this paper we report the sequence of the human enzyme. 
We have also expressed an active protein and show that it 
forms a phosphoenzyme when incubated with its substrate. 



at 30°C in a final volume of 3 ml; it was purified as in [6], Radioactive 
compounds and Thermosequenase were from Amersham and Pwo 
polymerase from Boehringer, Chemicals were from Sigma or Merck. 
Clones R14208, N23530, N42133, W05752, H38879 and T82144 from 
the IMAGE consortium [7], were kindly provided by the UK HGMP 
Resource Centre. 

2.2. Amplification of cDNA 

DNA obtained from human cDNA libraries (from renal cell carci- 
noma line LE9211-RCC [8], from the urinary bladder transitional-cell 
carcinoma LB831-BLC, or from EBV-transformed lymphoblastoid 
cell line LG2-EBV [9]) was amplified with Pwo polymerase (a polym- 
erase with proofreading activity) using two primers chosen as de- 
scribed in Section 3. The first (GTGCATATGGTCTCCCACTCA- 
GAGCTG) had a start codon (underlined) inserted in a Ndel site 
and the second (ACGGATCCTCATCTGAAGTTG1TTGGAGQ 
corresponded to nucleotides 882-902 of the sequence that we report 
in the EMBL databank, flanked by a BamHl site. The PCR-amplified 
product obtained with the human kidney library was purified by 
agarose gel electrophoresis, inserted in the EcoRV site of pBluescript 
and sequenced, 

2.3. Expression and purification of recombinant human 
L-3-phosphoserine phosphatase 

The insert of the resulting plasmid was excised with Ndel and 
BamHl restriction enzymes and ligated in the expression vector 
pET3a [10]. BL21(DE3)pLysS cells were transformed with this plas- 
mid and were aerobically grown in M9 medium at 37°C until Am 
reached 0.5-0.6. The culture was then maintained on ice for 15 min, 
after which time isopropylthiogalactoside was added to a final con- 
centration of 0.4 mM. After 20 h of incubation at 37°C, bacterial 
extracts were prepared as described [11], the lysis buffer being supple- 
mented with 5 ug/ml of antipain and 5 Ug/ml of leupeptin. The extract 
was centrifuged for 40 min (20000Xg at 4°Q. The resulting super- 
natant (25 ml) was diluted 3-fold with buffer A (10 mM Tris-HCl (pH 
8.5), 1 mM dithiothreitol, 1 ^ig/ml leupeptin, 1 ug/ml antipain) and 
applied onto a DEAE-Sepharose column (1.6X10 cm). The column 
was washed with 100 ml of buffer A and protein was eluted with a 
NaCl gradient (0-400 mM in 250 ml of buffer A). L-3-Phosphoserine 
phosphatase came out at a Na + concentration of 250 mM. The active 
fractions were pooled and concentrated 4-fold by ultrafiltration in an 
Amicon cell (YM-10 membrane). Two milliliters of this preparation 
were further purified by gel filtration on Sephacryl S-200 (1.6X50 cm) 
in buffer B (20 mM Hepes (pH 7.5), 1 mM dithiothreitol, 100 mM 
KC1, 0.5 ug/ml leupeptin and 0.5 ug/ml antipain) at a flow rate of 0.25 
ml/min. 



2. Material and methods 

2.1. Materials 

L-[ 32 P]Phosphoserine was synthesized by incubating 1 mM L-serine 
with 10 uM 32 P-labeled inorganic pyrophosphate (0.10 mCi), 2 mM 
magnesium acetate, 1 mM dithiothreitol, 25 mM Tris-HCl (pH 7.1), 
and 0.03 U pyrophosphate: L-serine phosphotransferase [5] for 40 min 



* Corresponding author. UCL 7539, Avenue Hippocrate 75, 
B-1200 Brussels, Belgium. Fax: (32) 2-7647598. 
E-mail : vanschaftingen@bchm.ucl,ac.be 



2.4. Purification and sequencing of rat liver 
h-3-phosphoserine phosphatase 
L-3-Phosphoserine phosphatase was purified from rat liver by a 
previously described procedure [12], followed by a gel filtration on 
Sephacryl S-200 equilibrated with buffer B. The active fractions 
were submitted to SDS-PAGE. Staining with Coomassie Blue re- 
vealed the presence of about 10 bands with M r 20000-70000. The 
gel was sliced, and the polypeptides were extracted and subjected to a 
renaturation procedure [13]. Phosphoserine phosphatase activity was 
observed only in the fraction corresponding to a 25 000 M t polypep- 
tide. Tryptic peptides were obtained from this protein and sequenced 
as described [14]. 
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Fig. 1. Alignment of 1-3-phosphoserine phosphatase sequences from various species (ratu: Rattus norvegicus; schi: Schistosoma mansoni; homo: 
Homo sapiens; esch; Escherichia coli\ haem: Haemophilus influenzae; meth: Methanococcus jannaschii; sacc: Saccharomyces cerevisiae). The 
tryptic peptides are shown for the rat enzyme. Similar and conserved residues are shown in boxes. 



2.5. Enzyme assays 

In the course of its purification, L-3-phosphoserine phosphatase was 
assayed at 30°C by the release of Pi from unlabelled L-phosphoserine 
in an assay mixture (250 ul) containing 25 mM Mes (pH 6.5), 5 mM 
MgCl 2 , 1 mM dithiothreitol, 5 mM L-phosphoserine, 0.1 mg/ml bo- 
vine serum albumin and 1-10 mU L-3-phosphoserine phosphatase. 
Reactions were stopped by the addition of 250 ul of 10% trichloro- 
acetic acid (TCA) and Pi was measured [15]. One unit of enzyme is the 
amount that catalyzes the conversion of 1 umol of substrate per 
minute under these conditions. For kinetic studies, the enzyme was 
assayed by the release of [ 32 P]Pi from L-[ 32 P]phosphoserine [16] in the 
assay mixture described above. The exchange reaction was measured 
by the incorporation of L-[ 14 Qserine into L-phosphoserine in an assay 
mixture containing 25 mM Hepes (pH 7.5), 1 mM dithiothreitol, 
1 mM MgCl 2 , 1 mM L-phosphoserine, 0.5 mM L-serine and 20000 
cpm L-[ 14 C]serine in a final volume of 100 ul. The incubation was 
arrested by spotting 25 ul of the reaction mixture on Whatman DE- 



81 papers, which were washed in water, dried and counted for radio- 
activity in the presence of 5 ml of HiSafe 2 scintillant. 

2.6. Demonstration of the formation of the phosphoenzyme 

Unless otherwise indicated, about 0.15 U of recombinant L-3-phos- 
phoserine phosphatase was incubated at 0°C in a mixture containing 
50 mM Hepes (pH 7.5), 1 mM dithiothreitol, 1 jiM L-phosphoserine, 
60000 cpm of L-[ 32 P]phosphoserine, 5 mM MgCl 2 , 20 mM L-serine 
and 1 mg/ml bovine serum albumin in a volume of 100 ul. The re- 
action was stopped at the indicated times by the addition of 250 |il of 
5% TCA and filtration on a polyethersulfone membrane (Supor® 200, 
from Gelman), The membrane was washed with 10 ml of TCA 5% 
and counted for radioactivity with 5 ml of HiSafe 2 scintillant. 

2. 7. Other methods 

Protein was measured according to Bradford [17] with bovine gam- 
ma-globulin as a standard. Sequencing was performed using T7 Ther- 
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mo Sequenase (Amersham), fluorescent primers and the LI-COR au- 
tomated DNA sequencer 4000L. 

3. Results 

5.2. Identification and sequencing of human cDNAs 

The predicted amino acid sequence of the S. mansoni en- 
zyme, most likely the closest to that of the human enzyme, 
was used to perform a Blast search [18]. Several human ESTs 
(expressed-sequence tags) were identified, which corresponded 
to the 5' (IMAGE Consortium [LLNL] cDNA clones 
R14208; N23530; N42133; W05752) or the 3' end (clones 
H38879 and T82144) of the parasite enzyme's open reading 
frame. Oligonucleotide primers derived from these sequences 
were used in PCR reactions to amplify human cDNAs derived 
from bladder or kidney tumor cells, or of lymphocytes. In all 
cases one single fragment of 725 bp was obtained. The frag- 
ment obtained by amplification of the kidney cell cDNA was 
cloned in pBluescript and sequenced. Several of the above- 
mentioned clones were also sequenced. The composite se- 
quence of the reconstituted human cDNA is reported in the 
EMBL databank. Note that clones N42133 and W05752 
lacked nucleotides 281 to 757, and that in clone N23530, there 
was a 122 bp insert homologous to the Alu I family between 
nucleotides 280 and 758. 

The sequence totals 1600 bp; the 5' and 3' non-coding 
regions amount to 187 and 735 bp, respectively. The ATG 
codon opens a reading frame of 675 bp, encoding a protein 
of 225 amino acids. This protein shows 30% and 40% se- 
quence identity with the enzymes from E. coli and S. mansoni, 
respectively. It shows also a high degree of identity with sev- 
eral peptides derived from L-3-phosphoserine phosphatase pu- 
rified from rat liver (Fig. 1). 

3.2. Expression and characterization of a recombinant protein 
To check that the sequence encoded L-3-phosphoserine 

phosphatase, the open reading frame was inserted in pET3a 
[10] and expressed in K coll Induction of the cells with iso- 
propylthiogalactoside led to the appearance of a 25000 M T 
polypeptide which after 20 h represented about 5% of the 
soluble proteins. L-3-Phosphoserine phosphatase was purified 
by chromatography on DEAE-Sepharose and Sephacryl S- 
200, from which it was eluted with an apparent molecular 
mass of 50 000 Mr, indicating a dimeric structure (not shown). 
The purified protein had a specific activity of 6 U/mg protein, 
the yield of the purification being 25%. 

The purified protein was dependent for its activity on the 
presence of Mg 2+ , which half-maximally stimulated the en- 
zyme at 5 mM. It displayed a K m of 20 uM for L-phospho- 
serine in the presence of 1 mM Mg 2+ , was non-competitively 
inhibited by L-serine {K\ = 0.5 mM) and catalyzed an exchange 
reaction (incorporation of L-[ u C]serine into L-phosphoserine) 
at a rate about 50% that of the hydrolysis (3 }imol/min/mg 
protein). These properties are similar to those of other l-3- 
phosphoserine phosphatases [1,2,19]. Other phosphate esters 
(glucose 6-phosphate, fructose 6-phosphate, glycerol 2- or 3- 
phosphate and 3-phosphoglycerate, all tested at 5 mM) were 
not significantly hydrolyzed by the purified preparation. 

3.3. Demonstration of the formation of a phosphoenzyme 
Upon incubation of human L-3-phosphoserine phosphatase 

with L-[ 32 P]phosphoserine, a phosphoenzyme was formed. 




Time (sec) 

Fig. 2. Time-course of the labelling of the rat liver enzyme with 
L-[ 32 P]phosphoserine, L-3-Phosphoserine phosphatase (0.15 U) was 
incubated at 0°C in the presence of radiolabeled L-phosphoserine 
with (o) or without (•) 10 mM L-serine. 



This could be demonstrated using a method in which protein 
was precipitated in acid and immediately isolated by nitration. 
Other methods in which niters are washed for prolonged time 
(30-60 min) [20,21] yielded insignificant incorporation, pre- 
sumably because of the lability of the phosphoenzyme (see 
below). As shown in Fig. 2, the formation of the phosphoen- 
zyme was greatly favored by L-serine which acted most likely 
by slowing down the degradation of L-phosphoserine. In the 
presence of 10 mM L-serine, the K m for the formation of this 
phosphoenzyme was 5 llM and a maximum of 2,5 nmol of 
phosphate was incorporated per milligram of protein, indicat- 
ing a stoichiometry of 0.06 mole per mole of enzyme subunit 
(not shown). SDS-PAGE [22] under denaturing conditions (at 
0°C, to minimize hydrolysis) confirmed that the phosphate 
was only incorporated into the 25000 M T polypeptide (not 
shown). The phosphoenzyme was acid labile, being 50% hy- 
drolyzed after 50 min of incubation in 5% TCA at 50°C. It 
was extremely labile in alkali, since it was completely lost 
when resuspended in 1 M NaOH at 0°C and reprecipitated 
in TCA, It was also completely hydrolyzed when resuspended 
in 0.2 M NH 2 OH at pH 5.3 (not shown) and maintained for 
10 min at 20°C. 

4. Discussion 

We report here the identification of clones encoding human 
L-3-phosphoserine phosphatase. This identification rests on 
the similarity of the encoded protein with the E. coli enzyme 
as well as with tryptic peptides derived from the purified rat 
liver enzyme. The identity was confirmed by expression of the 
cDNA and demonstration that the encoded protein behaved 
as a specific L-3-phosphoserine phosphatase with properties 
similar to those of the enzyme present in mammalian tissues. 

The fact that L-3-phosphoserine phosphatase catalyzes an 
exchange reaction between L-serine and L-phosphoserine and 
that it is non-competitively inhibited by L-serine suggested 
that the mechanism of this enzyme involved the formation 
of a phosphoenzyme. This is also consistent with its sensitivity 
to vanadate [23,24]. We show in this work that, indeed, a 
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phosphorylated enzyme is formed upon incubation with the 
substrate. 

Many phosphotransferases proceed through the formation 
of a phosphoenzyme. Four different types of residues have 
been shown to be implicated in covalently binding phosphate 
in these enzymes: serine, as in alkaline phosphatase [25]; his- 
tidine, as in acid phosphatase [26], fructose 2,6-bisphosphatase 
[27] and phosphogiycerate mutase [28]; cysteine in tyrosine 
phosphatases [29]; and aspartate in ATPases of the Na + /K+ 
ATPases family [30,31]. Histidine and cysteine are most likely 
not involved since there is no conserved cysteine or histidine 
in L-3-phosphoserine phosphatase (see Fig. 1). Furthermore 
the lability of the phosphoenzyme to alkali and to NH 2 OH 
indicates an acyl-phosphate linkage, as in phosphoaspartate 
or phosphoglutamate. The low stoichiometry of phosphoryla- 
tion and the lability of the phosphoenzyme bond prevented us 
from isolating a tryptic phosphopeptide suitable for sequenc- 
ing. The identity of the phosphorylated residue remains there- 
fore to be determined. 

Two highly conserved motifs containing aspartate residues 
are found in L-3-phosphoserine phosphatases: DXDST and 
GDGXXD. The first one shares two residues (DXXXT) 
with the consensus phosphorylation site of ATPases of the 
Na + /K + ATPase family. The second motif is also found in 
the same family of ATPases, where it is highly conserved (e.g., 
residues 713 to 718 of the alpha subunit of human Na + /K + 
ATPase), These considerations indicate that there may be 
functional homology between phosphoserine phosphatases 
and ATPases. 
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Summary 

Serine biosynthesis in plants proceeds by two pathways; 
a photorespiratory pathway which is associated with 
photorespiration and a pathway from phosphoglycerate, A 
cDIMA encoding plastidic phosphoserine aminotransferase 
(PSAT) which catalyzes the formation of phosphoserine 
from phosphohydroxypyruvate has been isolated from 
Arabidopsis thaliana. Genomic DNA blot analysis indicated 
that this enzyme is most probably encoded by a single 
gene and is mapped on the lower arm of chromosome 4. 
The deduced protein contains an N-terminal extension 
exhibiting the general features of a plastidic transit 
peptide, which was confirmed by subcellular organelle 
localization using GFP (flreen flourescence protein). 
Northern analysis indicated preferential expression of 
PSAT in roots of light-grown plants, supporting the idea 
that the phosphorylated pathway may play an important 
role in supplying the serine requirement of plants in 
non-green tissues. In situ hybridization analysis of PSAT 
revealed that the gene is generally expressed in all types 
of cells with a significantly higher amount in the meristem 
tissue of root tips. 

Introduction 

Serine is a key intermediate in a number of important 
metabolic pathways including the photorespiratory 
metabolism of glycolate to phosphoglycerate (Walton and 
Woolhouse, 1986). It is also Important as a precursor 
biomolecule in the generation of glycine, tryptophane, 
cysteine and in the interconversion of homocysteine and 
methionine (Ireland and Hiltz, 1995; Walton and Wool- 
house, 1986). Besides these functions, serine is involved 
in the synthesis of phospholipids, porphyrins, purines and 
thymidine, and is the source of one-carbon units. 

In the yeast Saccharomyces cerevisiae, serine and 
glycine are mainly synthesized by two pathways. On 
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fermentable carbon sources, serine is generated via the 
glycolytic pathway from phosphoglycerate, whereas on 
non-fermentable carbon sources, the other pathway, 
namely the gluconeogenic pathway (Melcher etal., 1995) 
starting from glyoxylate, is utilized. In animals and 
bacteria, serine is mainly synthesized via the phos- 
phorylated pathway {Ichihara and Greenberg, 1957) utiliz- 
ing phosphoglycerate, which is derived from glycolysis or 
from the oxidative or reductive pentose phosphate 
pathway. 

The situation is more complex in photosynthetic plant 
tissues since biosynthesis of serine can proceed via the 
photorespiratory pathway (Tolbert, 1980) (Figure 1). 
Glycolate is the substrate utilized during photo- 
respiration (Hatch, 1976). Its production in the chloroplasts 
initiates the photorespiratory pathway where glycolate 
is oxidized to glyoxylate from which glycine is generated. 
In the mitochondria, serine is generated by reactions 
involving two molecules of glycine; one molecule is 
oxidatively decarboxylated and deaminated to produce an 
active hydroxymethyl group that becomes attached to the 
p-carbon of the second molecule. The glycine decar- 
boxylase multi-enzyme complex (GDC), along with the 
enzyme serine hydroxymethyltransferase (SHMT), is 
responsible for the respiratory conversion of glycine to 
serine (Neuberger etal., 1986; Srinivasan and Oliver, 1995). 
The cDNAs encoding for the four different component 
enzymes of GDC {Kim and Oliver, 1990; Macherel etal., 
1990; Turner etal., 1992b, 1992c) and SHMT (Turner etal, 
1992a, 1993) from plants have been cloned and charac- 
terized. Most of the serine produced is returned to Calvin 
cycle via hydroxy pyruvate and glycerate. 

In the absence of photorespiration, serine could be 
synthesized from phosphoglycerate by the non-phos- 
phorylated pathway, involving glycerate and hydroxy- 
pyruvate as intermediates or via the series of reactions 
catalyzed by phosphoglycerate dehydrogenase, phospho- 
serine aminotransferase (PSAT) and phosphoserine 
phosphatase (Bryan, 1988). In animal cells, the non-phos- 
phorylated pathway is postulated to participate in serine 
catabolism rather than serine biosynthesis (Snell, 1986). 
However, the conversion of serine to phosphoglycerate is 
readily reversible in plants (Tolbert, 1980), Serine is pro- 
bably formed by this pathway in germinated pea cotyle- 
dons which lacked phosphoserine aminotransferase 
activities (Walton and Woolhouse, 1986). 

In the phosphorylated pathway, the glycolytic inter- 
mediate, phosphoglycerate, is oxidized by phosphoglyce- 
rate dehydrogenase to form phosphohydroxypyruvate, 
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which in turn is transaminased by phosphoserine amino- 
transferase to yield phosphoserine. In the final step, 
dephosphorylation of phosphoserine is catalyzed by phos- 
phoserine phosphatase (Ichihara and Greenberg, 1957; 
Stolz and Dornemann, 1994). It has been proposed that 
the latter pathway is probably of minor significance com- 
pared to the photorespiratory pathway during daylight 
hours where photorespiration takes place, but may be of 
more importance in the dark or early in leaf development 
(Ireland and Hiltz, 1995), However, there was evidence 
for a phosphorylated pathway operating in green tissues 
during photosynthesis (Ireland and Hiltz, 1995; Keys, 1980). 

The existence of metabolites that can be formed by two 
or more pathways raises interesting questions concerning 
their regulation and expression. Since various biosynthetic 
enzymes are distributed among different subcellular organ- 
elles in plant cells, it is possible that significant regulation 
of the metabolism of serine may control the rate and extent 
of the transport of intermediates and products of the 
pathways between various metabolic sites (Bryan, 1988). 

In our previous paper (Saito etal, 1997), we reported 
the isolation of a cDNA clone encoding PSAT from spinach 
and the preliminary characterization of the cDNA. In the 
present paper, we describe for the first time cDNA and 
genomic cloning, biochemical characterization and expres- 
sion as well as the immunolocalization and In situ hybridiza- 
tion of PSAT in Arabidopsis tha liana. 

Results 



clones selected for further studies, CPSAT-5 was shown to 
contain the largest cDNA Insert (1.7 kb). 

Sequence analysis revealed the presence of an open 
reading frame which is 1290 nucleotides in length. The 
deduced 430 amino acids encode PSAT and have a 
molecular weight of 47 359 Da. The first ATG triplet is 
designated as the translation start site because the 
surrounding sequence (AATC ATG GC) agrees well with the 
favoured sequence (AACA ATG GC) flanking the consensus 
functional plant-initiator codon (Lutcke etal., 1987). 
Furthermore, an in-frame stop codon is found 51 nucleo- 
tides upstream of the translation start methionine in the 
cDNA sequence. Primer extension experiments confirmed 
the above assumption (see below). The cDNA sequence 
shows the presence of a 3'-untranslated region of 189 
nucleotides downstream of the translation stop codon. The 
polyadenylation signal is most likely the AATAAT motif 
located 105 nucleotides downstream from the translation 
stop codon. 

The deduced amino acid sequence shares 34-71% 
homology with bacterial, yeast and spinach PSATs 
(Figure 2a). Both the signature sequence of PSAT proteins 
(Belhumeur et a/., 1994) and the conserved sequence motif 
involving the pyridoxal-phosphate binding lysine site found 
in Class V aminotransferase (Ouzounis and Sander, 1993) 
are conserved in Arabidopsis PSAT. A phylogenetic tree 
(Figure 2b) indicates that PSATs from Arabidopsis and 
spinach form a family distinct from bacterial and animal 
PSATs. 



Isolation and sequence of cDNA encoding PSAT of 
Arabidopsis 

The Arabidopsis ESTs clone FAFL52, which shows 55-67% 
homology to bacterial and yeast PSAT proteins, was used 
to screen 2.5 x 10 5 plaques from a whole plant Arabidopsis 
cDNA library constructed in )*gt11. Among the four positive 



Genomic cloning and sequence analysis of PSAT 

On screening approximately 2 x 10 5 plaques of the 
Arabidopsis genomic library with CPSAT-5, 2 positive 
clones, GPSAT-28 and GPSAT-45, were selected for further 
analysis. The 5 kb fragment, which covers part of the PSAT 
structural gene and the upstream region, was isolated from 
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Figure 2. Comparison of deduced amino acid sequences of PSAT from Arabidopsis and other organisms. 

YEAST, PSAT from Saccharomyces cerevisiae (Belhumeur era/., 1994); HAEIN, Haemophilus influenza {Fleischmann era/., 1995); SALGL, Salmonella 
gallinarum (Griffin, 1990); ECOLI, Escherichia coii (Duncan and Coggins, 1986); YERCN, Yersinia enterocolitica (O'Gaora era/., 1989); RABBIT, Oryctolagus 
cuniculus (Misrahi ef al„ 1987); SPINACH, Spinacia oleracea (Saito er a/., 1997); BACCIR, Bacillus circulans (Battchikova et a/., 1996); ARABIDOPSIS, Arabidopsis 
thaliana (the present study). 

(a) Multiple alignment of deduced amino acid sequences of PSAT from Arabidopsis and other organisms. Underlined letters indicate the conserved sequence 
motif involving the pyridoxal-phosphate binding lysine site found in Class-V aminotransferase {Ouzounis and Sander, 1993). Double-underlined letters 
indicate the signature sequence for PSAT proteins (Belhumeur era/., 1994). A dashed line indicates the putative transit peptide used for construction of GFP 
fusion protein. 

(b) Phylogenetic tree of PSATs using the phylogenetic program package. Phylogenetic tree, constructed using phylip 3.57c program by J. Felsenstein, indicates 
that PSATs from Arabidopsis and spinach form a distinct family from bacterial, yeast and animal PSATs. 



clone GPSAT-45 and sequenced, whilst the 1.1 kb fragment 
which covers the rest of the structural gene and the 3' 
region was isolated from GPSAT-28. Comparison of this 
sequence with the cDNA sequence of CPSAT-5 revealed 
that no intron is present in the coding region of PSAT 
gene. The transcriptional start site was mapped by using 
primer extension. A single transcriptional start site is 
located at 171 bp before the translation start site. An AT- 
rich sequence is located at -12 to -22 and a potential CAAT 
sequence at -94 to -97. 

Southern blot analysis was used to estimate the 
number of genes encoding PSAT in Arabidopsis. Arabi- 
dopsis genomic DNA digested with six restriction endo- 
nucleases was fractionated on an agarose gel. The result, 
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as shown in Figure 3, revealed that a single major fragment 
hybridized upon digestion with fcoRV, Sad and Xbal 
However, upon digestion by SamHI, Bgi\\ and fcoRI, several 
bands were observed. The appearance of the strongly 
hybridizing bands is due to the presence of several restric- 
tion sites for the endonucleases in the genomic sequence, 
whereas the weak signals observed are most probably due 
to the other isoforms of PSAT, or are maybe just due to 
the presence of pseudogenes. These results suggested that 
Arabidopsis contains a single gene corresponding to the 
isolated cDNA and a few related sequences homologous 
to the cDNA in the genome. Using the 10.5 kb fragment 
from GPSAT-45 as probe for analysis, the Arabidopsis 
PSAT gene was mapped to the lower part of chromosome 4. 
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Biochemical characterization of recombinant PSAT 

The 60 amino acid leader sequence exhibits the general 
features of a transit peptide for transportation of protein 
to the plastid. It starts with methionine-alanine; is rich 
in the hydroxylated amino acids, serine and threonine 
(16/60); has the small hydrophobic amino acids, alanine 
and valine (12/60); is essentially deficient in the acidic 
amino acids, aspartic acid and glutamic acid (1/60); and has 
a net positive charge (pi = 10.7). The prediction from the 
PSORT program (GenomeNet service, Osaka University) 
was that it is localized in the stroma of the chloroplast. 

The fusion protein of CPSAT-GFP (green fluorescence 
rjrotein) could be detected in intact tissues after delivering 
the constructs into Arabidopsis leaves by particle gun 
bombardment. The expressed signals in both constructs 
containing the predicted transit peptides of PSAT from 
Arabidopsis (Figure 4b) and spinach (Saito era/., 1997) 
(data not shown) were similar to those exhibited by transit 
peptide of the ribulose-1,5-bisphosphate carboxylase small 
subunit polypeptide of Arabidopsis (Krebbers era/., 1988) 
(Figure 4a) which was already known to be sufficient for 
translocation into chloroplasts (Chiu et al, 1996). The bright 
green fluorescence was clearly visible despite the red 
chlorophyll autofluorecence from chloroplasts under 
blue-light. 

These results confirmed that the N-terminal sequence 




Figure 3. Southern blot analysis of genomic DNA. 
Genomic DNA was extracted from the leaves of 3-week-old seedlings. 
For every lane, about 5 j.ig genomic DNA was digested with restriction 
enzyme, separated by eiectrophoresis through a 0.8% (w/v) agarose gel, 
transferred to a Hybond N + membrane (Amersham), and then hybridized 
with 32 P-labelled CPSAT-5. The final washing was performed in 0.1 x SSPE, 
0.1% SDS at 65X for 10 min. 



of Arabidopsis and spinach PSATs are sufficient for trans- 
localization of passenger protein into chloroplasts. Thus, 
it is likely that the PSATs are plastidic proteins. 

The identity of the isolated cDNA, CPSAT-5, was 
confirmed by successful complementation of a serine- 
auxotrophic mutant lacking the PSAT locus SerC. The 
mutant E. coli KL282 {tonA22, phoA4{Arr\) (serSH, serCW), 
serS13, ompF627, supD32 {ser(JI32) f relA1, pit10, spoTl, 
T 2 R) (Low et al, 1971) was transformed with an expression 
plasmid, pPSAT-AB13TS, in which the expression of PSAT 
cDNA is regulated by the lacZ promoter. Transformants 
could grow in the M9 minimal medium in the absence of 
serine, whereas the pTV118N transformed E. coli KL282 
was not able to grow without supplementation with serine 
(Figure 5), confirming the authenticity of the CPSAT-5 
encoding the functional PSAT. 

The recombinant PSAT was overexpressed in £ coli 
AD494 using a pET32a(+) vector system with a strong T7 
promoter. The His-tagged PSAT protein was purified by 
using a His-Bind R purification column (Novagen). Enzyme 
activity was not detected in the purified protein which was 
in the insoluble and inactive form. Expression of PSAT in 
a soluble form was too low to be detected by using 
SDS-PAGE, however it exhibited PSAT activity of 
0.72 ± 0.15|imoles min*" 1 mg of crude protein"* 1 . Specific 
activity of PSAT overexpressed in E. coli BL21, using the 
pET3d vector system, without His-tag was also investi- 
gated. The protein in the crude extract exhibited higher 
activity (1.54 ±0.01 |imoles min" 1 mg of crude protein" 1 ) 
than the overproduced protein in the E. coli AD494/ 




M9 plate + Serine M9 plate 




Figures. Functional complementation of set€~ B. coli KL282 (Low era/., 
1971) by transformation with the expression vector pPSAT-AB13TS carrying 
CPSAT-5. 

pTV118N is the empty vector used as a negative control, The transformed 
bacteria were cultured on M9 minimal agar plates with 1.0 mg ml" 1 serine 
(left plate) or without serine (right plate) supplementation at 30°C. 
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Figure 4. Organelle sublocalization of PSAT using GFP as reporter protein, 
Particle gun bombardment was carried out using Helios Gene-Gun System, 
at pressure 100 psi to transform 3-week-old Arabidopsis seedlings. A 
228-bp {dashed line in Figure 2a) PCR-amplified fragment of PSAT from 
Arabidopsis was subcloned into plasmid CaMV35S-sGFP(S65T)-NOS3\ 
The 35Sft-TP-sGFP(S65T) construct, carrying the transit peptide sequence 
obtained from the ribulose-1,5-bisphosphate carboxylase small subunit 
polypeptide of Arabidopsis {Krebbers era/., 1988) was used as a positive 
control, (a) Fluorescence signal pattern exhibited by 35Sft-TP-sGFP{65T), 
which was known to be targeted to chloropiasts (Chiu et ah, 1996). 
(b) Fluorescence signal pattern exhibited by CPSAT-TP of Arabidopsis (this 
study}. Bar = 100 \irr\. 




Figure 8. In situ hybridization of Arabidopsis with PSAT antisense probe, 
Blue color is toluidine blue staining, Longitudinal sections of root tip (a) 
and root (b,c) under dark field optics. (a,b) Signals - bright spots indicated 
by arrowheads, (c) Negative control, being hybridized with PSAT sense 
probe. Bars = 200 ^m; m, meristem; ep, epidermis; vb, vascular bundle, 



pET32a(+ ) system. The recombinant protein was detected 
as a 41 kDa protein by SDS-PAGE (Figure 6). The PSAT 
expressed was truncated to remove approximately 60 
amino acid residues of putative transit peptide from 
the full-length deduced open reading frame. Yet it was 
catalytically active and, in fact, resulted in enhanced protein 
accumulation in E. coli crude extract compared to the full- 
length deduced open reading frame. 

We determined K m values for glutamate and phospho- 
hydroxypyruvate using the overexpressed crude proteins 
of PSAT from Arabidopsis and spinach in E coli BL21/ 
pET3d system. Double reciprocal plots of the initial rates 
data demonstrated K m values for the Arabidopsis PSAT of 
70 um and 5 mM for glutamate and phosphohydroxy- 
pyruvate, respectively, and for the spinach PSAT of 150 u.M 
and 3 mM for glutamate and phosphohydroxypyruvate, 
respectively. In the range of 5-50 mM, serine, threonine, 
valine, glycine, tryptophane and O-acetyl-L-serine had no 
effect on the rate of reaction. Enzyme inhibition was 
observed with a high concentration of cysteine. 

The PSAT activity of recombinant enzyme of Arabidopsis 
was inhibited by the antiserum raised against the His- 
Bind R purified recombinant protein overexpressed in the 
E. coli AD494/pET32a(+) system. Inhibition was linear with 
the amount of antibodies added to the reaction mixture 
(data not shown). 



Gene expression of PSAT 

Northern blot analysis was carried out to examine the level 
of mRNA expression in shoot and root tissues from light- 
grown and dark-treated plants (Figure 7). The highest level 
of PSAT mRNA expression was observed in the light-grown 
roots followed by a significant amount of mRNA expression 
in light-grown shoots. A lower level of expression was also 
detected in dark-grown shoots and roots. The preferential 
expression of PSAT mRNA in the roots was contrasted with 
mRNAs of two enzymes involved in the photorespiratory 
pathway, the H-subunit of GDC (Srinivasan and Oliver, 
1995) and serine hydroxymethyltransferase (SHMT) 
(Turner et ah, 1992a), which accumulated primarily in the 
light-grown shoot tissues, although a less pronounced 
signal was also detected in the dark-grown shoots. 




Figure 10. Immunolocalization of PSAT protein 
in Arabidopsis {a-c are cross-sections of stem). 

(a) Negative control, incubated with pre- 
immune serum, in which only autofiuorescence 
signals of xylem were observed, 

(b) Incubated with rabbit antiserum raised 
against CPSAT, green fluorescence signals 
(indicated by arrowheads) were detected in the 
stele, preferentially associated with xylem. 

(c) A non-fluorescent micrograph of stem, ed, 
endodermis; xy, xylem. Bar = 100 um, 
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Figure 6. Overexpression of CPSAT-5 in pET3d and determination of enzyme 
activity of PSAT in the crude extract of E. coli BL21. 
Lanes: M, standard proteins as molecular weight marker (Pharmacia); 
BL21/pET3d, soluble fraction of E. coli BL21 carrying an empty vector 
pET3d; BL21/pPSAT-AB23PS, soluble fraction of E. coli BL21 carrying 
plasmid pPSAT-AB23PS with CPSAT-5 in sense orientation to the promoter; 
BL21/pPSAT-AB23PA, soluble fraction of E. coli carrying plasmid pPSAT- 
AB23PA with CPSAT-5 in antisense orientation. For the enzyme assay, the 
activities of PSAT were determined in soluble protein extracts using 
the method described previously (Duncan and Coggins, 1986). Data are 
expressed as mean ± SD [n = 3) umoles min -1 mg of crude protein -1 . 
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Figure 7. Northern analysis of RNA from different tissues of Arabidopsis 
seedlings. 

Ten ng total RNA was separated under denaturing conditions in a 1.2% 
agarose gel containing formaldehyde and transferred to a Hybond N + 
membrane (Amersham) and then probed with 32 P-labelled cDNA clone. The 
final washing was performed in 0.1 X SSPE, 0.1% SDS at 65°C for 10 min. 
(a) The mRNA of PSAT was accumulated in both leaf and root tissues, with 
the highest amount preferentially expressed in root tissues from light- 
grown plants. 

(b,c) Preferential expression of H-protein, a subunit of GDC, and SHMT 
mRNA in leaf tissues of light-grown plants although a much lower amount 
of expression was also detected in dark-treated leaf tissues. 



In situ hybridization results suggested that mRNA of 
PSAT was expressed at some level in all types of cells in 
leaf, stem and root tissues (data not shown). However, a 
strong signal was detected in meristem tissue of root 
tips (Figure 8a), indicating a significantly higher level of 
expression in fast proliferating tissues. The root tissues 
surrounding the vascular bundle also seemed to be sites 
of preferential expression (Figure 8b). 



Protein accumulation and immunolocalization of PSAT 

The antibodies raised against the overexpressed PSAT 
in the pET32a(+) system recognized the overexpressed 
protein in pET3d as well as the associated protein in 
Arabidopsis. Western blot results (Figure 9) showed that 
PSAT protein was undetectable in dark-treated seedlings. 
Immunolocalization experiments (Figure 10) indicated that 
PSAT protein was preferentially accumulated in the stele, 
in particular the cells close to the xylem in leaf, stem and 
root sections, 



Discussion 

Structural and functional properties of the PSAT 
molecule 

Our previous paper (Saito era/., 1997) described the 
preliminary characterization of PSAT by cDNA cloning 
from spinach. The present study is the first extensive 
molecular characterization of the serine biosynthetic 
pathway from phosphoglycerate in plants. 

The nucleotide sequence corresponding to the Arabi- 
dopsis PSAT mRNA encodes a precursor of mature protein 
of 430 amino acid residues. The spinach and Arabidopsis 
PSAT enzymes were engineered so that the protein 
expressed in E, coli had approximately 60 amino acid 
residues truncated from the full-length deduced open 
reading frame. These enzymes were catalytically active. 
Therefore, the N-terminal extension of these cDNAs may 
function as a transit peptide. Essential common characters 
of a chloroplast transit peptide suggests that the PSAT is 
located in the plastid/chloroplast. The gene product shows 
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Figure 9. Western blot analysis of PSAT. 

Protein extracted from 3-week-old Arabidopsis was separated by SDS- 
PAGE and eiectrochemically transferred onto Immobilon P membrane. It 
was then reacted with 1:500 dilution of rabbit antiserum raised against 
purified recombinant PSAT fused with His-tag. The immuno-reactive 
proteins were visualized using phosphatase-labelled goat antirabbit and 
NBT and BCIP as substrates. PSAT protein was preferentially expressed in 
both leaves and roots of Arabidopsis light-grown seedlings. No significant 
signal was detected for the extracts from dark-treated seedlings. 



high homology to PSAT amino acid sequences from other 
sources. Both the signature sequence for PSAT proteins 
and the common sequence motif involving the pyridoxal 
binding lysine found in Class V aminotransferase are 
conserved in Arabidopsis PSAT The gene product function- 
ally complemented an £ coli serC mutant strain. Enzyme 
activity associated with PSAT in £ co//crude extract, which 
overexpressed the cDNA, further confirmed its identity. 

Serine failed to inhibit PSAT from both Arabidopsis and 
spinach. Our findings agree with observations by Reynolds 
etaL (1988) using pea PSAT but were in contrast to the 
report of Larsson and Albertsson (1979) that PSAT from 
spinach was strongly inhibited by serine. The antibodies 
against the recombinant PSAT could inhibit the activity of 
the overexpressed PSAT £ coli crude extract. 

Physiological significance of gene expression and 
accumulation of PSAT protein 

The co-existence of two or more biosynthetic pathways in 
higher plant suggests that either of two pathways may be 
preferentially utilized depending upon the physiological 
state of plants at a given time. It is conceivable that serine 
is derived from glycolate via glycine during photo- 
synthesis. In the absence of photorespiration, in par- 
ticular, in rapidly proliferating tissues of low photosynthetic 
activity (Cheung era/., 1968; Reynolds and Blevins, 1986), 
the phosphorylated pathway may play the primary role in 
the total cellular supply of serine. This is probably because 



the photorespiratory pathway does not efficiently operate 
in the dark {Keys, 1980). Therefore, photosynthetic tissues, 
e.g. leaves of light-grown plants, may have a greater 
capacity for utilizing glycine as a serine precursor; whereas 
in non-photosynthetic tissues, e.g. roots, dark-treated 
leaves and etiolated leaves, etc., the phosphorylated path- 
way may take over to be the main supply of serine in 
plants (Ireland and Hiltz, 1995; Reynolds etaL, 1988; Saito 
era/., 1997). 

Under normal conditions, only minor PSAT activities 
were measured in the leaves of spinach (Larsson and 
Albertsson, 1979) and pea (Walton and Woolhouse, 1986), 
while in tissues associated with rapid cell proliferation, e.g. 
seed leaves and apical meristems of pea (Cheung era/., 
1968) as well as root tissues of soybean and lupin (Reynolds 
and Blevins, 1986), considerable amounts of PSAT were 
found. Stolz and Dornemann (1994) also detected PSAT 
activity in a Scenedesmus obliquus mutant with low photo- 
respiratory activity under normal C0 2 condition which is 
comparable to tissues with rapid cell proliferation. The 
phosphorylated pathway enzymes are important in 
proliferating plant and animal tissues where there is a 
high serine requirement. Snell (1985) reported that the 
rat neoplastic tissues have higher serine hydroxymethyl 
transferase and phosphoserine aminotransferase activi- 
ties, perhaps reflecting the increased demand for one- 
carbon fragments ultimately required for DNA synthesis. 
Our findings are consistent with the previous reports. 
We provide molecular evidence that mRNA of PSAT is 
preferentially expressed in roots from light-grown plants 
(Figure 7), especially in the meristem tissue as revealed by 
in situ hybridization results (Figure 8). A minor amount of 
PSAT mRNA expression in the shoots from light-grown 
plants may be due to the non-photosynthetic cell types 
that co-exist in the leaves and stems. 

In contrast to the preferential expression of PSAT, mRNA 
abundance of H-protein (a subunit of GDC) and SHMT, the 
two enzymes which are responsible for the photorespira- 
tory pathway in shoots of light-grown plants, far exceeded 
their expression in roots or dark-grown shoots (Figure 7). 
Again, this finding supports the idea that the photorespira- 
tory pathway predominates in photosynthetic tissues. The 
phosphorylated pathway may play an important role in 
supplying serine to the root. However, further physiological 
studies of the fluxes through the two pathways have to be 
compared to confirm the above result. Further investi- 
gations are necessary to clarify the interaction of the two 
pathways at a molecular and physiological level precisely. 

Immunolocalization of PSAT protein (Figure 10) provided 
further information that PSAT protein is mainly detected 
in cells in and around the vascular bundle, and is 
preferentially associated with xyiem in leaf, stem and root 
tissues. Since these cells do not take part in photo- 
synthesis, the phosphorylated pathway may therefore play 
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an important role in supplying serine requirement to these 
tissues, It appears likely that the increase in protein syn- 
thesis is partly regulated at the mRNA level. 

Meanings of plastidic localization of PS AT 

Reynolds and Blevins {1986} reported that PSAT was 
localized in the proplastid, a conclusion based on enzyme 
isolation from the plastidic fraction. However, Larsson and 
Albertsson (1979) detected 90% of PSAT activity outside 
the chloroplasts in leaves in contrast to phosphoserine 
phosphatase, the enzyme responsible for the final step in 
the phosphorylated pathway which was of exclusively 
chloroplastic origin. Thus, Larsson and Albertsson (1979) 
proposed that the phosphohydroxypyruvate formed by 
phosphoglycerate dehydrogenase was exported from 
chloroplasts to form phosphoserine which was then 
imported into the chloroplasts to serve as a substrate for 
phosphoserine phosphatase. We provide the first sub- 
cellular localization evidence which shows the targeting of 
spinach and Arabidopsis PSAT proteins into leaf chloro- 
plasts. The function of the transit-peptides located at the 
N-terminus of the cDNAs were confirmed, 

The Southern blot analysis suggested the presence of a 
single gene corresponding to the isolated cDNA, but we 
do not exclude the possibility of related sequences encod- 
ing for other isoforms of PSAT in Arabidopsis since several 
weakly hybridizing bands were also observed in addition to 
the expected bands based on the genomic DNA sequence. 
However, it is premature to suggest the isolated gene is a 
main gene without first knowing whether other genes 
encoding PSAT species exist, and how many, as well as 
their physiological roles. The presence of another isoform 
of PSAT which is not localized in the plastid remains a 
possibility and requires further investigation. 

Experimental procedures 

Plant materials 

Arabidopsis thaliana ecotype Columbia seeds were germinated 
and grown on GM agar plates (Valvekens eta/., 1988) under 16 h/ 
8 h light and dark cycles at 22°C for 3 weeks. For the dark-treated 
seedlings used for Northern analyses, 2-week-old seedlings were 
wrapped in aluminium foil and subsequently grown for another 
1 week before RNA extraction was carried out. 

Isolation ofcDNA and genomic clones 

For the isolation of cDNA which encodes for PSAT, approxi- 
mately 2.5 X 10 5 amplified plaques of the Xgtll cDNA library of 
A. thaliana ecotype Columbia were subjected to screening with 
the 32 P-labelled probe synthesized from the cDNA insert of the 
Arabidopsis EST, FAFL52 (accession no. ATT4557). The Arabidopsis 
cDNA clone, CPSAT-5, was used to screen for genomic clones from 



an Arabidopsis EMBL3 SP67T7 library (CLONTECH). Approximately 
2 x 10 5 amplified plaques were screened. 

Hybridization of the membranes (Hybond N + Amersham) was 
carried out at 65°C in 5 X SSPE (0.9 m NaCI, 0.05 m sodium phos- 
phate pH 7.7, 5 mM EDTA), 0.5% SDS, 5 x Denhardt's solution and 
25 ng ml -1 salmon sperm DNA. Membranes were washed at 
maximum stringency in 1 x SSPE, 0.1% SDS for 10 min twice and 
final washing was conducted at 65°C in 0.1 x SSPE, 0.1% SDS for 
10 min, and then exposed to Fuji X-ray film. 



Nucleic acid preparation and blot analyses 

Genomic DNA was extracted from the leaves of 3-week-old 
seedlings as described by Dellaporta era/. (1983). Isolation of total 
RNA was performed by a modified guanidine-HCi method as 
described by Sambrook etal. (1989) from the leaves and roots of 
3-week-old seedlings. DNA and RNA blots were hybridized with 
32 P-labelled probe synthesized from cDNA insert fragment of the 
isolated clone. To investigate the mRNA expression levels of GDC 
and SHMT, 32 P-labeled probes synthesized from cDNA inserts of 
Arabidopsis EST clones, 200K16T7 and 111M16T7, respectively, 
were used. To verify equivalent loadings of RNA on blots, 
membranes were hybridized with a 32 P-labelled probe synthesized 
from rice rDNA (pRR217). Relative values of mRNA were calculated 
based on the hybridization intensities of specific signals on blots 
quantified by a BAS-2000 image analyzer (Fuji). 

Restriction fragment length polymorphism (RFLP) mapping was 
carried out by 30 recombinant inbred lines (Lister and Dean, 1993}. 
Hybridization and washing were carried out as described above 
except that the final washing condition was reduced to 0.5 X SSPE, 
0.1% SDS for 10 min at 65°C. 32 P-labelled probe was synthesized 
from genomic clone, GPSAT-45, and used for hybridization. The 
map distance was calculated based on the RFLP profiles generated 
by Hhal 



Overexpression and purification of recombinant enzyme 

A general method of DNA engineering was followed according to 
Sambrook etal, (1989). Nco\ sites were created on both ends of 
the coding region by polymerase chain reaction (PCR) engineering 
using synthetic primers: AB1 (5'-GTCTACCATGGGCTCCGTC- 
GGATCCCAAG-3') and AB2 (5'-CACGTCCATGGAGAAAGATTT- 
TGAT-3'). The engineered DNA fragment was inserted into the 
A/col site of pET3d or pET32a(+) (Novagen-TaKaRa, Kyoto, Japan), 
in which the cDNA was placed under a strong 010 promoter in 
both sense and antisense orientation. The plasmids were then 
introduced into £ coli BL21 (DE3) pLysS or AD494 (DE3) pLysS in 
which the gene for lysogenic T7 RNA polymerase under /acUV5 
promoter is induced by isopropyl-1-thio-p-D-galactopyranoside 
(IPTG). The recombinant protein accumulated in the insoluble 
fraction of E. coli AD494 extracts was purified using HlS-BIND R 
buffer kit (Novagen) following the instructions as described in 
the manual. 

Transformants of E coli KL282 {tonA22, phoA4{Am) (serSH 
serCW), serS13, ompF627, supD32 {serUI32), relAl, pit10, spoT1, 
T 2 H) mutant (Low et al., 1971) with plasmids pTV118N and pCPSAT- 
AB13T, respectively, were tested for their ability to grow on M9 
minimal medium. 



Protein harvesting from Arabidopsis 

Approximately 0.1 g of leaf tissue was added to 300 jil of lysis 
buffer (2 mM Tris-HCI, pH 8.0; 0.14 m NaCI; 1 mM phenylmethyl- 
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sulfonyl fluoride) and homogenized with a pestle in an Eppendorf 
tube, followed by centrifugation at 10 000 g for 5 min at 4°C. The 
supernatant was used for protein quantitation and SDS-PAGE. 



Enzyme assay 

The assay for PSAT activity is based on that described by Duncan 
and Coggins (1986). Assay mixtures contained 50 mM Tris-HCI 
(pH 8.2), 32 mM ammonium acetate, 2 mM glutamate, 0.2 mM 
NADH, 2.5 mM phosphohydroxypyruvate, 2 units of glutamate 
dehydrogenase and 10 \i\ protein extract. The reaction at 25°C 
was initiated by the addition of phosphohydroxypyruvate, and the 
absorbance change at 340 nm was measured. 



Subcellular organelle localization of PSAT by GPP 

To verify the subcellular localization of PSAT in plant cells, a 
228 bp PCR-amplified fragment of PSAT from Arabidopsis using 
primers 5'-AGTGTGTTCGACATGGCGGCTACGACGAACT-3' and 
5'-CCTGGCCATGGCAAGTTGAAGACACGTTCT-3'; and a 216 bp 
PCR-amplified fragment of PSAT from spinach using primers 5'- 
GAGAAGTCGACATGGCAATGGCGGCCACCT-3' and 5'-CCTGGC- 
CATGGCAAAGTTGAAGACGCGTT-3', were subcloned into 
plasmid CaMV35S-sGFP(S65T)-NOS3' (Chiu era/., 1996) with a 
CaMV 35S-promoter at its 5'-end, GFP as reporter gene and 
3'NOS as transcription terminator. Plasmid construct, 35SQ-TP- 
sGFP(S65T) (Chiu era/., 1996) carrying a transit peptide sequence 
obtained from the ribulose-1,5-bisphosphate carboxylase small 
subunit polypeptide of Arabidopsis (Krebbers era/., 1988), was 
used as a positive control. These plasmids were used for sub- 
sequent particle gun bombardment. 

Particle gun bombardment was carried out using the Helios 
Gene-Gun System (Bio-Rad) following standard protocol provided 
by the supplier. Plants were incubated for 20 h under illumination 
conditions at 22°C after bombardment. Individual leaves were 
viewed with a fluorescent microscope (BX50-FLA, Olympus) using 
the Chroma's Dual Band filter, FITC & TRITC (Olympus). Photo- 
graphs were taken using Fujichrome ISO 100 film. 



Immunolocalization of PSAT 

Immunolocalization of PSAT protein was performed on 3-week- 
old A. thaliana ecotype Columbia seedlings. The plant tissues 
were initially fixed in 4% formaldehyde, 50% ethanol and 5% 
acetic acid overnight at 4°C. Fixed tissues were dehydrated and 
embedded in polyethyleneglycol (PEG)#1540 (Kanto Chemical) as 
described by Marrison and Leech (1992). Sections of 8 ^m were 
mounted onto slides coated with 3-aminopropyltriethoxysilane 
(Sigma) and left to dry overnight at 42°C. The sections were 
soaked in phosphate-buffered saline (PBS; 0.16 m NaCi, 8 mM 
Na 2 HP0 4 , 2,7 mM KCI, 1.5 mM KH 2 P0 4 ) to remove PEG and 
incubated overnight at 4°C with 100 jal of 1 : 200 dilution of rabbit 
antiserum raised against PSAT in 0.5% (w/v) BSA/PBS. After 
sequential washing with 0.5% (w/v) BSA/PBS for 15 min, with 
0.01% (v/v) Tween 20/PBS for 15 min and with PBS for 15 min, 
sections were incubated with FITC conjugated goat antirabbit IgG 
(Sigma) in a diluted concentration recommended by the supplier, 
in 0.5% (w/v) BSA/PBS at room temperature for 2 h. Slides were 
mounted with a drop of Vectashield and viewed by a fluorescent 
microscope (BX50-FLA, Olympus) with Chroma's Dual Band filter, 
FITC & TRITC. Photographs were taken using Fujichrome ISO 
400 film. 



In situ hybridization 

In situ hybridization was carried out on 3-week-old A. thaliana 
seedlings, which were fixed in 4% formaldehyde, 50% ethanol 
and 5% acetic acid for 3 h at room temperature. Fixed tissues 
were dehydrated and embedded in paraffin wax according to 
standard procedures (Angerer and Angerer, 1992). Ten ]xm sections 
were mounted onto slides coated with 3-aminopropyltriethoxysil- 
ane and pre-treated for hybridization according to Angerer and 
Angerer (1992). 35 S-UTP-labelled sense and antisense RNA probes 
were generated by run-off transcription with T7 and T3 RNA 
polymerase (Promega). Labelled RNA probes were hydrolyzed to 
an average length of 300 nucleotides. The hybridization mix 
contained 35 S-labelled mRNA <5X 10 6 cpm slides -1 ), 10 mM Tris- 
HCI (pH 7.5), 50% (v/v) formamide, 0.3 M NaCI, 1 mM EDTA, 
150 |ug ml -1 yeast tRNA, 1 X Denhardt's, 10% (w/v) dextran sulfate, 
and 70 mM dithiothreitol. RNase treatment washing steps were 
performed as described by Angerer and Angerer (1992). Slides 
were coated with Hypercoat emulsions (Amersham) and then 
developed with Kodak D-19. After staining with 0.05% (w/v) toluid- 
Ine blue and following dehydration, the slides were mounted in 
PolyMount (Polysciences Inc., Warrington, USA). Photographs 
were taken in a microscope (BX50, Olympus) using dark-field 
optics and Fujichrome ISO 100 film. 



Miscellaneous techniques 

DNA subcloning and sequencing, sodium dodecyl sulfate (SDS)- 
polyarylamide gel electrophoresis (PAGE), Western blot analysis, 
protein quantitation and primer extension were carried out as 
described by Sambrook era/. (1989), 
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The mel-32 gene in the free living soil nematode Cac- 
norhabditis elegans encodes a serine hydroxymethyl- 
transferase (SHMT) isoformu Seventeen ethylmethane- 
sulfonate (EMS)-indnced mutant alleles of roe^2(SHMT) 
have been generated, each of which causes a recessive 
maternal effect lethal phenotype. Animals homozygous 
for the SHMT mutations have no observable mutant 
phenotype, but their offspring display an embryonic le- 
thal phenotype. The Mel-32 phenotype has been rescued 
with a transgenic array containing only mel-$2{Sl$MT) 
genomic DNA, Heteroduplex analysis of the 17 alleles 
allowed 14 of the mutations to be positioned to small 
regions. Subsequent sequence analysis has shown that 
16 of the alleles alter highly conserved amino acids, 
while one allele introduces a stop codon that truncates 
two thirds of the predicted protein. we£32(SHMT) has a 
55-60% identity at the amino acid level with both iso- 
forms of SHMT found in yeast and humans and a 50% 
identity with the Escherichia coli isoform. The C. el- 
egans mel-32 mutation represents the first case where 
SHMT has been shown to be an essential gene. 



Serine hydroxymethyltransferase (SHMT) 1 is a highly con- 
served, ubiquitous, pyridoxal 5'-phosphate (PLP)-containing 
enzyme that catalyzes the reversible conversion of serine and 
tetrahydrofolate to glycine and 5,10-methylenetetrahydrofo- 
late (1), The 0-carbon of serine is the major source of one- 
carbon units in the one-carbon metabolic pool (2). The methyl 
group passed on through the folate cofactor of SHMT is used in 
thymidylate, methionine, lipid, and purine biosynthesis. 
SHMT also catalyzes many secondary reactions, such as amino 
acid transaminations (for a full review of the reactions cata- 
lyzed by SHMT, see Ref, 1). 

Eukaryotic organisms have two nuclear genes that encode 
two distinct isoforms of this enzyme, a cytosolic form and a 
mitochondrial form, Recent evidence suggests that there is a 
subcellular partitioning of the reactions catalyzed by SHMT, 
with the serine to glycine conversion occurring in the mitochon- 
dria, and the glycine to serine conversion occurring in the 
cytoplasm (3). A comparison of the determined and predicted 
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amino acid sequences of many SHMTs reveals a striking degree 
of homology between all the known forms of this enzyme (2, 4, 
5), These alignments show that the amino and carboxyl termini 
of SHMT are less conserved but that the middle two thirds of 
the sequence have long stretches of very high identity. 

Serine hydroxymethyltransferase levels are elevated in rap- 
idly proliferating cell lines and tumors (6, 7). When lympho- 
cytes are treated with a mitogenic stimulus, the enzymatic 
activity of SHMT is increased and the incorporation of the 
/3-carbon of serine into DNA is increased (6). SHMT activity is 
also increased during the S phase of the cell cycle, suggesting 
that a product of SHMT activity is utilized during cell division. 
Conversely, when cells stop proliferating, SHMT levels decrease, 
Retinoic acid treatment of P19 embryonal carcinoma cells stops 
proliferation and stimulates differentiation. Furthermore, it has 
been shown recently that retinoic acid causes a 50% decrease in 
SHMT transcript levels (7), Based on these ftndings, serine hy- 
droxymethyltransferase has been proposed as a potential chem- 
otherapy target (8), A chemical or drug that decreases SHMT 
activity may cause rapidly proliferating tumor cells to quiesce. 

Prior to this study, the only observable phenotype of SHMT 
deficiency was glycine auxotrophy. In Escherichia coli } when 
the single copy of SHMT iglyA) is mutated, glycine auxotrophy 
results (9). In the yeast SaccIiaromyce$ cerevisiae, glycine aux- 
otrophy is observed only when both forms of SHMT and a third 
gene,g/yA, are mutated (4) suggesting afunctional redundancy 
in the glycine synthesis pathway. In addition, a line of Chinese 
hamster ovary cells that lack mitochondrial SHMT activity are 
glycine auxotrophs (3, 10). This deficiency is rescued when 
the cells are transfected with a cDNA encoding the human 
mitochondrial SHMT (11). 

This report presents the cloning and characterization of a 
Caenorhabditis elegans SHMThomolog called mel-32. Mutations 
in mel-32 result in a maternal effect lethal (Mel) phenotype. The 
C. elegans mel-32 mutation is the first reported case of a serine 
hydroxymethyltransferase deficiency causing lethality. 

EXPERIMENTAL PROCEDURES 
Growth and Handling ofC. elegans— Mi strains of C. elegans were 
grown at 20 °C on nematode growth medium plates streaked with & 
coli OP50 as a food source. Standard genetic manipulation followed 
previously described protocols (12), 

Computer Analysis-* The analysis of sequence data, sequence com- 
parisons, and data base searches were performed with ACeDB (a C. 
elegans data base) U$), 2 the BLAST 3 (provided by the NCBI server) and 
FASTA programs {14, 16), CLUSTALW (16), and MacDNASIS Pro 



2 ACeDB documentation, code, and data are available from anony- 
mous FTP servers at Hrmm.lirmra.fr, cele.mrc-lmb.cam.ac.uk, and 

n °3 ^rBLAST^program is available via the World Wide Web (http:// 
www.nchi.nlm.nih.gov/blast). 
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(Hitachi Software Engineering Co., Ltd.). PCR primers were designed 
with the aid of01igo(m 

Cloning the C. elegans SHUT Gene— The cosmid clone of C05D11 
(GenBank accession number U00048) was kindly provided by Dr, Alan 
Coulson (Medical Research Council, Hinxton, United Kingdom), A com- 
puter restriction analysis of C05D11 (MacDNASlS) revealed that cod- 
ing elements 11 (SHMT) and 13 could be isolated as a 5052-base pair 
(bp) Pvull*Scal (Life Technologies, Inc., Pharmacia Biotech Inc.) frag- 
ment (residues 36311-41363 of the GenBank entry). The plasmid pGV9 
(Fig. 2.) was constructed by ligating the Pvull-Scal fragment into 
KcoRV (Life Technologies, Inc.)-cufc pBluescript II K8 + (Stratagene). 
The plasmid pC05.ll (Fig, 2.) was constructed by cutting pGV9 with 
Pstl (New England Biolabs) and reKgating, leaving a 3196-bp fragment 
of C05D11 (residues 36311-39607), containing only open reading frame 
11 (SHMT), in pBluescript. 

Mutant Rescue— The subclones containing the SHMT gene were 
injected into the syncytial gonad of adult wild-type (N2) C. elegans 
hermaphrodites together with the dominant marker rol-6(sul006) (18, 
19) contained on plasmid pCesl943. 4 Stable transgenic strains, those 
expressing the roller phenotype in successive generations, were used in 
the rescue experiments. 

Heteroduplex Analysis— DNA from individual C. elegans heterozy- 
gous for each of the SHMT mutations was isolated as described by 
Barstead et al (20) with the modifications of Williams et al. (21). PCR 
using Taq polymerase (BioCan Scientific) was performed as follows; 6 
pmol of each of five sets of primers (synthesized by DNAgency, Malvern, 
PA) were used (Table I), The samples were incubated at 94 *C for 1 min 
on an Idaho Technology 1605 Air Thermo-Cycler before commencement 
of 30 cycles of amplification (94 <C for 10 s, 59 °C (62 °C for the D primer 
set) for 20 s, and 72 °C for 40 s). Following a 2-min incubation at 72 *C, 
1,25 fd of BDTA (0.1 m, pH 7.5) was added to terminate the reaction. 
Heteroduplex analysis was performed using a mutation detection en- 
hancement gel matrix (J.T. Baker) following the method of Nybroek et 
al. (22) with the following changes; a heteroduplex denaturation/rean- 
nealing profile of 95 °0 for 3 min, 85 °C for 3 min, 75 *C for 5 min, 65 °C 
for 5 min, 56 °C for 5 min, and 37 *C for 5 min was used. The hetero- 
duplex DNA was then resolved on a 0.5 X mutation detection enhance- 
menfc gel at 400 V for 16-20 h, 

DNA Sequence Analysis— "DNA from individual nematodes homozy- 
gous for the me/-32(SHMT) mutations was isolated and amplified with 
the same primers used in the heteroduplex analysis. A total of 0.6 /il of 
this template DNA was incubated at 95 *C for 3 min on a Precision 
Scientific GTC-2 Genetic Thermo Cycler before commencement of 35 
cycles of amplification (94 *C for 45 s, 59 °C (62 *C for the D primer set) 
for 30 s, and 72 *C for 1 min) followed by a polishing step of 72 fl C for 7 
min. The PCR products from two separate reactions were pooled, puri- 
fied by agarose gel electrophoresis, and collected with a Qiagen QIA- 
quick gel extraction kit. A total of 100-200 ng of each PCR product was 
sequenced on both strands using FS Taq terminator chemistry (Applied 
Biosciences) on a Perkin-BImer GeneAmp PCR System. The reactions 
were run on an Applied Biosciences model 373A automated DNA se- 
quence analyzer located at the Nucleic Acid-Protein Service Unit, Uni- 
versity of British Columbia. 

RESULTS AND DISCUSSION 

Identification of a C, elegans Homolog of SHMT— A search of 
the ACeDB program revealed one SHMT homolog, C05D11.11, 
located on chromosome III within the area defined by the 
cosmid C05D11. The position of the CeSHMT gene was based 
on both a Genefmder 6 prediction and on five partial cDNAs 
isolated as part of the C. elegans genome sequencing project. 
The predicted gene contains four exons coded in 1599 bp of 
genomic sequence. The predicted 484-amino acid protein se- 
quence was used to search the GenBank data bank with the 
FASTA algorithm (14, 15), and the closest homologs were rab- 
bit and human cytosolic SHMTs (61.7% and 61.3% identity, 
respectively, in a 470-amino acid overlap). 

A sequence comparison of SHMTs shows the highly con- 
served nature of this protein and reveals many conserved do- 
mains (Fig. 1). The amino- and carboxyl-terminal 50 amino 
acids are the least conserved, and the central three-quarters of 



the protein contain large stretches of completely conserved 
amino acids. The PLP cofactor binding site (residues 301-305 
in the C. elegans protein) and the active site lysine (residue 
306) are conserved in every case. 

Rescue of mel-32 with C. elegans SHMT— Out laboratory is 
in the process of constructing a transgenic library of sequenced 
cosmids that can be used for high resolution genetic mapping 
(23), Stable transgenic arrays are generated, which can act as 
cosmid sized duplications in rescue experiments. If the genomic 
DNA present in the extrachromosomal array rescues the reces- 
sive lethality of an essential gene then the wild-type copy of the 
mutation must be present in the DNA denned by the trans- 
genic. Previous rescue experiments in our laboratory (23) have 
placed five essential genes in the genomic region defined by the 
cosmid C05D11. The genes let»7l3, let-721, let-725, let*756, and 
mel-32 are all rescued by C05D11, making these genes candi- 
dates for potential mutations in CeSHMT. 

The mel-32 mutant phenotype was rescued with CeSHMT 
genomic DNA, indicating that mel-32 encodes SHMT. All res- 
cue experiments were performed with the canonical allele of 
mel-32, s2518 t which was isolated in an EMS mutagenesis 
screen for maternal effect lethals. 6 Subsequently, mel-32 was 
found to be allelic with 16 EMS-induced Mel alleles from a 
collection of maternal effect embryonic lethal mutations on 
chromosome III isolated by H. Schnabel and R. Schnabel. 7 
Hermaphrodites homozygous for the mel-32 mutations have no 
observable mutant phenotype, but their self-fertilteed offspring 
display an embryonic lethal phenotype and arrest at about the 
100 cell stage. The Mel phenotype of mel-32 was rescued with 
pGV9, which contains coding element 11 (SHMT) and 13 (Fig. 
2). To determine in which of these two genes the mutations 
reside, a smaller subclone, pC05.11 (Fig. 2), containing only 
gene 11 (SHMT) was constructed. This subclone, pC05.ll, gave 
a partial rescue of the Mel-32 phenotype. From tfieZ-32;pC05,ll 
transgenic hermaphrodites, a small number of progeny hatch 
and grow to adulthood but are not themselves fertile. pGV9 
contains the entire 1254 bp of intergenic sequence between 
gene 11 (CeSHMT) and gene 13, while pC05,ll contains only 
416 bp upstream of the CeSHMT ATG start codon. The partial 
rescue by pC05.11 suggests that some important regulatory 
sequences are missing from the smaller subclone. 

Heteroduplex Analysis of meU32(SHMT) Mutations—Tho 17 
alleles of mel-32 were analyzed by heteroduplex analysis to de- 
fine more precisely the regions where the mutations occurred. 
Five sets of overlapping PCR products were generated from an- 
imals heterozygous for each allele. The primers used for PCR are 
listed in Table I, the exact location of each is shown in Fig. 3, and 
the overlap of each product is shown in Fig. 4. Each amplified 
DNA product was heat-denatured and allowed to cool slowly. Any 
PCR product that contains a mutation will have a mixture of 
homoduplexes and heteroduplexes, paired wild-type and mutant 
strands that contain base pair mismatches. These duplexes were 
run on a special mutation detection enhancement gel (see "Ex- 
perimental Procedures*), which can reveal a single base pah- 
mismatch in a short strand of DNA (24). The presence of multiple 
bands indicates a mutation in that particular overlapping frag- 
ment. Using this procedure, 13 out of the 17 mutations were 
placed into specific regions of the gene. 

These mutations were distributed as follows. Region A (Fig. 
4) contained one mutation: U473 (which was also detected in 
region B; see below). Region B contained six mutations: U473 
(see below), tl555, tl597 t tl66$, tl$79, and s2$l$. Region C 
contained three mutations: tW52> tl616, and tI63X, Region D 



4 S, J. M. Jones and W. B. Barbazuk, personal communication. 
6 P, Green and L. Hillier, manuscript in preparation. 



0 G. Vatcher, unpublished results. 

7 H. Schnabel and R. Schnabel, unpublished results. 
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C05D1X.11 MADRQVHTPLAKVQRHKYTNNENXLVDHVEKVDPEVFDIMKNEKKRQRRGL 51 

glyc human MTMPVNGAH KDADLWS S HDKMLAQ P LKDSDVEVYNIXKKESNRQRVGL 

glyc yeast MPYT&SDAHHKLXTSHLV DTDPEVPSIXKDBXERQKHSI 

glym human MAIRAQHSWMQTQTGEANRGWTGQE-SLSJDSDPEMWELLQREKDRQCRGL 

glym yeast M FP RAS ALAKCMATVH RRGLLT5 GAQS LVS K PVS EGD P EM FDI LQQERHRQKHS I 

E.OOli SHMT MLKBEMN— lAPYDABI^AMEQEKVRQEEHI 

consensus d DpE. i » B RQ 

V Q KV 

C05DU.11 ELXASBNmK&VMDALGSAMCmYSEGYPG^ 111 

glyc human ELIABENFASI^VLEALGSCLlWKYSEGYPGQRYYGGTEFlDELE'rLCQKRALOAYKIiDP 

glyc yeast DLIASENFTSTSVFDALGTPLSNKYSEGYPGARYYGGNEHIDRMEILCQQRALKAFHVTP 

glym human ELrASENFCSRAAliEALGSCLNNKYSEGYPGKRYYGGAEWDEIELLCQRRAIiEAFDLDP 

glym yeast TlilPSENFTSKAVMDLSGSELQNKYSEGYPGERYYGGNEIXDKSESLCQARALEItYGLDP 

E, cOii SHMT ELXASENYTSPRVMQAQGSQLTNKYAEGYPGKRYYGGCEYVDIVEQLAtDRAKELFGAD- 

consensus eLXaSENytS av a Ge 1 NKYsEGYPG RYYGG E iD E haq RAle £ ldp 

V D F E 

C05D11, 11 AKWGVNVQPLSGSPANFAVYTAIVGSNGRIMGLDLPDGGHLTHGFFTPA-RKVSATSEFF 170 

glyc human QCWGVNVQP Yfl GS PAN FAVYTALVE PHGRXMGLDLP DGGHLT HG FMTDK- KKI S ATS IFF 

glyc yeast DKWGVNVQTI.SGSPANLQVYQAIMKPHERLMGt»YI J PDGGHLSHGYATEN-RKrSAVSTYF 

glym human AQWGVNVQPYSGSpANLAVYTALLgPHDRIMGLDLPPGGHLTHGYMSDV" KRIS ATS IFF 

glym yeast AKWGVWQPLSGAPANLYWSAIMMVGERIMGLPIjPDGGIILSHGYQIiKSGTPISFISKYF 

E.coli SHMT YANVQPHS GS QAN FAVYTALIi E P GDT VLGMHIiAHGGH JjTHG S P VN F 1 SGKLY 

consensus WgvNVQp SGspAN aVYtA, , £ , mGldLpdGGHLtHG Is a s £ 



C05DU.11 
glyc human 
glyc yeast 
glym human 
glym yeast 
E.coli SHMT 
consensus 



CO5011.11 
glyc human 
glyc yeast 
glym human 
glym yeast 
E.coli SHMT 
consensus 



stop E 
QSLPYKVDPTTGLIDYDKLEQMAMLFRPKAIIAGVSCYARHLDYERFRKIATKAGAYLMS 230 
ESMPYKVNPDTGYINyDQLEBNARLFHPKLIIAGTSCYSRNI.EYARLRKIADENGAYLMA 
ESFPYRVNPETGIIDYDTLEKNAILYRPKVl.VAGTSAYCRLrDYKRMRElADKCGAYI»MV 
BSMPYKkNpKTG&IDYNQLAI/TARLFRPRLI I AGTSAYARLI D YARMREVC DEVKAH E»JjA 
OSMPYHVDHTTGLIDYDNI.QVIAKAFRPKVIVAGTSAYSRLIDYARFKEISQGCGAYLMS 
N I V P YGI D- AT GH I D YADLEKQAK EHKPKMX I GGFSAYS GWP§AKMREI ADS XGAYLFV 
s PY v p TG IdYd Le A IfrPk iiaGtSaY.r .dyar reiad gAytm 

F Y T 

DMAHXSGLVAAGWPSPFEYSDVVTTTTHKSLRGPRGArflFYRKGVRSTN-AKGVDTliYD 289 
D^ISGLVAAGVVPSeFEHCHVVTTTTHKTLRGCRAGMIFYRKG^SWPKTGKEII,^ 
DMAHl SGtXAAGVX PS P FEYADI VTTTTHKSLRGPRGAMI FFRRGVRS INPKTGKEVLYD 
DhlAKISGLVAAKYIPSPFKHADXVTTTTHKTI.RGARSGWFYRKGVKAVDPKTGREIPYT 
DMAHXSGLVAANVVP5PFEHSP1V TTTTHKSL RGPRGAMIFFRKGIKSVTKK-GKEXPYE 

DMAHVAGIiVAAGVY PN P VP HAHVV frTT TH KXj LAG P RGG LI LAKGGS BE LY KKUf 

DMAH i s GL vAAg V PsPfeh.d. VTTTTHK, LrGpRg .If rkGv s k g e y 



i 



tr 

£ 
§ 

31 



C05D11. 11 LEEKXWSAVFPGLQGGPHNHTIAGIAVALROCLSEDFVQYGEQVLKNAKTIAERMKKHGY 349 

glyc human LESLXNSAVFPGtaGGPRNHArAGVAVALKQAMTLEFKVYQHQVVANCRAI/SEA^TELGY 

glyc yeast t»ENPINFSWPGHQGGPHNHTIAAIiATALKQAATPEFKEYQTQVLKNAKAiESEFKNLGY 

glym human fbdrxnfavfpslqggphmhaiaavavalkqactpmfrbyslqvlknaramadallergy 

glym yeast LEKKINFSVFPGHQGGPHNHTXGAMAVALKQAMSPEFKEYQQKIVONSKWFAQELTKMGY 

B.coli SHMT S AVFPGG^GGPI2^HVIAGKAVALKEAMEPEFKTYQQQVAJKNAKAMVEVFX*ERGY 

consensus le in aVFPg QGGPlnH la AvALkqa peFk Yq qv kNaka GY 



C05D11.11 
glyc human 
glyc yeast 
glym human 
glym yeast 
E.coli SHMT 
consensus 

C05Dll.ll 
glyc human 
glyc yeast 
glym human 
glym yeast 
E.coli SHMT 
consensus 

C0SD11.11 
glyc human 
glyc yeast 
glym human 
glym yeast 
E.coli SHMT 
consensus 



AIiATGGTDNHLLLVDLRPIGVEGARAEllVLDLAHIACNKNTCPGD-VSALRPGGIRLGTP 408 
KIWGGSDNHLXLVDliRSKGTDGGRAEKVLEACSIACNKNTCPGD-RSALRPSGLRLGTP 
RLVSNGTDSHM^VSLREKGVDGARVEYICEKINXALNKNSIPGD-'KSALVPGGVRXGAP 
S INS G GTDN H LVX»VDI*RP KGLDGARAERVJjEIjVS X TANKNTC P G D-RSAI T£ GG LRLGA P 
KLVSGGTDNHLIVrDI.SGTQVDGARVETILSALNIAANKNTXPGD-KSALFPSGI)RIGTP 
KWSGGTDNHLFLVDLVDKNLTGKEADAALGRAWITVNKNSVPNDPKSPFVTSGXgVGTP 
lvsgGtDnttl Ivd&r kg dGarae 1 la NKNt PgD Sal p G.R.G.P 

ALTSRGFQEQDFEKVGDFIHEGVOXAKKYNAEAG KTIiKDFKSFTETNEPFKKDVAD 4 64 

ALTSRGLXjEKDFQKVAHFXHRGIELTLQXQSDTGVR — ATLKEFKERXAGD-KYQAAVQA 

AMTT RGMGEE DFH RI VQYIN KAVE FAQQVQQ S I* P KDACRXiKDFKAKVDEGSDVXiNT 

ALTSRQFREDDFRRWDFXDEGVNIGLEVKSKT AKLQDFKSFLIiKDSETSQRLAN 

AMTT RGFGRE EFS QVAKYX DS AVKXiAEN LKT L EPT TKLDARS RLNE FKKLCN ES S - EVAA 

AXTP^GFKEAEAKELAGgMCDVLDSXNDEAVIER IKG 

A.T Rgf e df v i v k t 



LAKRVEEFSTKFEX PGNETF 
X/REEVESFASI/FPI»PGLPDF 

WKKEXYDWAGEYPLAV 

liRQRVEQFARAFPMPGFDEH 
LSGEXSKWVGQYPVPGDI— 
K— - VX/DXCARYPVYA 

l v .p.pg 
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Fig. 1. Comparison of the amino acid sequences of selected SHMTs. The aligned sequences are C. elegans toel-32 (C05D1W), human 
cytosolic'tetyc human) and mitochondrial (glym human), & cerevislat cytosolic (glyc yeast) and mitochondrial (glym yeast), and £ coli GlyA (Exoli 
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Fig. 2, SRMT subclones. The genomic subclones used to rescue 
mel$2 are shown. Nucleotide numbers correspond to the GcnBank 
entry for C05D11 (U00048). Restriction sites used for subcloning are 
indicated. 





Table I 






PGR primers 




Primer 


Sequence 


Product length 






bp 


Al 


ATGCATATACGTGTTGCC 


399 


A2 


TAATACCGAOCTCCAGGG 




Bl 


GAAGCGGCAGCGTCGTGG 


40S 


B2 


GAGACTTTGCGAGCTGGG 




01 


TATGGGCCTCGATCTTCC 


483 


C2 


GTGTCAACTCCCTTTGCG 




Dl 


CAACGACGCACAAGTCGC 


449 


D2 


GAGACGAATGCCTCCTGG 




Bl 


CGGAGTTGAAGGAGCTCG 


474 


B2 


AAGTGATAAAATGGCGGG 





contained three mutations: tUB6 y tl474> and tl576. Region B 
contained a single mutation: tl632. For allele tl666> no muta- 
tions were detected in regions A, C, D, and E, but a single PCR 
product could not be obtained for region B. It was therefore 
assumed that the lesion in this allele was contained in region 
B, which was later confirmed by direct sequence analysis. The 
position of the molecular lesions in alleles t!620 } tl607> and 
tl671 could not be detected using this procedure. 

Sequence Analysis of meU32(SHMT)-~1!h$ same oligonucleo- 
tide primers used to generate PCR products for the heterodu- 
plex analysis (Table I) were used to amplify DNA from homozy- 
gous mutant animals from the region in which the mutation for 
each allele was detected. For alleles U520, U607, and m71, in 
which heteroduplexes were not observed, DNA from all five 
regions was amplified from mutant strains. The PCR products 
were then directly sequenced in both orientations (see "Exper- 
imental Procedures"), 

The sequence analysis results are summarized in Table II 
and Fig. 4. Allele tl63l contains the only nonsense mutation 
identified for mel-32, The codon for glutamine 171 is mutated 
into an amber stop codon. Any truncated protein produced in 
this mutant would lack the terminal two thirds of the enzyme, 
including the active site. This allele is assumed to be a null> 
The phenotype of this mutant is indistinguishable from the 
other 16 alleles, suggesting that all of the alleles may be null, 
or that all reduce the SHMT activity below the threshold re- 
quired for embryonic survival 

Alleles tl466 and tU74 both contain the same base pair 
mutation, which changes glycine 313 to glutamic acid. In allele 
tl616> alanine 268 is changed to threonine. The amino acids at 
positions 313 and 268 of the SHMT consensus sequence (Fig. 1) 
are always either a glycine or an alanine. It is assumed that 



replacing a small, neutral amino acid with a larger acidic or 
polar one warps the secondary structure of SHMT enough to 
disrupt its enzymatic function. 

Eukaryotic forms of the SHMT holoenzyme exist as tetram- 
ers of identical subunits, and the predicted secondary structure 
contains alternating helices and strands, placing SHMT in the 
o/p class of proteins (6). Fig. 1 reveals that there are 22 com- 
pletely conserved glycines in the SHMT consensus sequence. 
Most of the conserved glycines are located at predicted turns, 
indicating a probable tertiary structural conservation (5). This 
suggests that these glycines are structurally important, being 
located at turns, at positions where helices cross, or at other 
constrained locations. The importance of having a glycine at 
five of these positions is proven by this work. In allele t!576 i 
glycine 372 is changed to arginine; in allele £1670, glycine 143 
is changed to aspartic acid; in alleles tl552 > tX632 t and t!665> 
conserved glycines 204, 406, and 149, respectively, are all 
changed to glutamic acids. Replacing these constrained gly- 
cines with large charged groups probably has a deleterious 
effect on the structure of SHMT. 

Two of the mutations cause alanine to valine alterations, 
t!666 (alanine 103) and s2B18 (alanine 126), These residues are 
conserved alanines in the SHMT consensus sequence, We as- q 
sume there are very strong structural constraints at these § 
positions, as the conservative change of an alanine to a valine §■ 
disrupts the structure enough to affect SHMT activity. A single | 
PCR product could not be obtained for allele t!666 with primer ^ 
set B in the heteroduplex analysis, but the point mutation does |f 
not interfere with the primer binding site (Fig. 3.). To eliminate ^ 
the possibility that t!666 contained a second mutation in one of | 
the B primer binding sites, the overlapping regions, A and C, y 
were sequenced. Neither region contained a mutation, so the % 
failure of this allele to yield a single PCR product with the B <3 
primer set remains a mystery. 

Allele U473 contains a double mutation, as suggested by the v 
heteroduplex analysis where region A and B both indicated the j? 
position of a mutation. Alanine 63 is changed to valine, and g 
leucine 146 is changed to phenylalanine, Residue 63 is an |> 
alanine four out of six times in the SHMT consensus sequence, pi 
but the E. coli protein has an arginine at this position, so we 8 
assume that changing this alanine to valine would have a <o 
minimal steric effect. However, residue 146 is a conserved 
leucine in the SHMT consensus sequence, suggesting that the 
leucine to phenylalanine missense mutation is the primary 
cause of the mutant phenotype in allele tl473. 

In alleles tWl and W20, serine 251 is changed to pheny- 
lalanine. Residue 251 is only three amino acids away from the 
active site and is always small in the SHMT consensus se- 
quence. The replacement of this small amino acid with a large 
aromatic one probably distorts the conformation of the active 
site and may also interfere with the PLP aromatic ring. 

In allele tl607, histidine 259 is changed to tyrosine. The 
mutation in allele tX607 i residue 259 in the SHMT consensus 
sequence, is the only one from our collection that has been 
mutated previously. This histidine is in the conserved active 
site VTTTTHK(S/T) motif found in all SHMTs and is adjacent 
to the active site lysine. Every residue in this motif, except the 
valine, has been mutated in the E. coli isoform of SHMT to 
determine the effects on catalysis (Fig. L). When the active site 
histidine is changed to asparagine, there is no structural 
change in the enzyme, but the catalytic activity is greatly 



SHMT). The alignment was made using the ClustalW program. In the SHMT consensus sequence (consensus), uppercase letters indicate ammo 
acids identical in all six proteins, lowercase letters indicate residues identical in at least four out of six proteins, and a dot indicates conserved ammo 
acids. The numbers indicate the positions in the C. elegans protein. The amino acids mutated in the mel-$2 alleles i are in bold, arid the mutant 
amino acid for each ib listed above the sequence. Residues that have been previously mutated in E. coh are boxed (see text for details). 
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-416 TGCAGTATCGTTTGCACTTGTTTTAA -389 

-390 ATGGTTT CTT TGCATAGTTTT CTGTAGCAATGTT TTGTGTTTGCATTTGAAAGTTAT CTG - 3 31 

-330 CATTAT ATTTTT GTT ATTAGAGACCACTTTTTTGTT TTTAT TACTAAGTTAGTGT ACACT -271 

-270 TGTTAAAACCACGCTCGAATACCTCTTTAATTACAATTTCTAATGAGTAACTTTTGACTT -211 



-210 AT CAGTTTTGATCTACGATCCG GGAT GCGC CCAATCTCGAAAACTTCCCGGTT CAGATAT 
-150 CATCATAGCCTACC GCGAATCCAGTTTTAT T CTTATTTT CTCC&CACTTTCCC ATGCATA 



-151 



-91 



Fig, 3. Nucleotide sequence of the 
C. elegans SHMT genomic region and 
derived amino acid sequence* Intronic 
regions are in lowercase, the primers used 
in the heteroduplex and sequence analy- 
sis are underlined, a putative SL1 leader 
sequence is denoted by a dashed line, and 
the poly(A) signal is double underlined* 
The nucleotides and amino acids mutated 
in the mel-32 alleles are in bold, and 
the allele number for each is listed. This 
sequence corresponds to positions 
39507-37321 (in reverse orientation) of 
the C05D11 GenBanlt entry (accession 
number U00048). 



-90 TACGTGTTGCCAACGCGTCAGCTTTTGTGTTGAAmTGTTCCCGATTTTTTCGCTTCTTT -31 


1 MAD R Q V H T P h 

-30 GCTTAATT TAATTT GGTTTTACAGT GTAAAATGGCT GATCGTCAAGTGCACACACCATTG 

11AKVQRHKYTNNENILVPHVE 
3 1 GCTAAAGTTCAGCGCC^GAAGTACACCAAGftAC^ 

31KVDPEVFDIMKN 

91 AAAGTTGATCCAGAAGtTTXCGATATCATGAAAAATgtaagacttaatgtttctcaaatc 

43 EKKRQRRGLBLXA 
151 ttaabcatttqtttttttcagGAGA AGAAGCGGCAGCGTCGTGGA CTTGAGC'TCATCGCT 

2 

56SENFTSKAVMDALGSAMCNK 
211 TCCGAGAACTTCAC^GGAAGGCJTGTTATGGATGCTC'TTGGCTCGGCAATGTGCAACAAA 



76 Y S E G Y P 



ARYYGGNEFIDQM 



271 TACAGTGAAGGAT ACCCTGGAGCTCGGTATTA CGGAGGAAATGAGTTCATCQATCAGATG 

96 EL&CQKRALEVFGIiDPAKWG 
331 GAGCTCCT TTG CCAGAAGAGAG CTCTTGAGGTATTCGGACTTGATCCAGC CAAGTGGGGA 

11 

116 VKVQ P h 3 G S PAN F A V Y T A I V 
391 GTC^TGTGCAGCCATTGTCCGGATCACCAGCCMTTTCGCAGTCTACACfGCCATCGTT 

ie z M 

136GSNGRIMGLDLPDGGHLTHG 
4 5 1 GGATCCAATGGACGCAT TATGGGCCTCGATCTTCCA GATGGAGGTGATTTGACTCATGGg 

ci 

156 F F T P 

511 taaggttatctagaCtagactcgaatgaattatacatttaatttcagATTCTTCACCCCA 

160 A R K V S A T S £ F F Q 8 I* PYKVDP 
57 1 GCTCG CAAAGTCTCT GCGACTTCTGAATT CTT CCAGT CT CTTCCATAC AAGGTT GATCCA 
52 

180TTGLIDYDKLEQNAMLFRPK 
631 ACAACTGGATT GATCGACTATGACAAGCTT GAGCAGAATGCAAT GCTTT TCCGTCCAAAA 



10 

30 



30 
90 



42 
150 



55 
210 



75 
270 



95 
330 



115 
390 



135 
450 



155 
510 



159 
570 



179 
630 



199 
690 



I 



t 

o 

<a 



I 1 

I 



200 AI IAGVSCYARHIjDYERFRK 219 

691 GCCATCATTGCCGGAGTTTCCTGCTACGCTCGTCATCTCGATTATGAACGTTTCCGTAAG 750 

220 lATKAGAYLMSDMAHI SGIiV 239 

751 ATTGCCACAAAGGCTGGAGCCTATTTGATGTCTGATATGGCTCACATCTCCGGACTTGTT 810 

4, IS » 

240 AAGI^IPSEFEYSDVVTTTTH 259 

811 GCCGCTGGACTTATCCCAT CACCATTCGAGT AT TCTGATGTTGTAACC ACAACGACGOAC 870 

260 KSX»RGPRGALIFYRKGVRST 279 

871 AAGTCGCT CAG AGGACCACGT GGAGCTTTGAT CTTCTACAGAAAGGGTGTCCGATCTACC 930 

280NAKGVDTLYDItEEK INSAVF 299 

931 AACGCAAAGGG AGTTGACAC TTTGTATGATTTGGAGGAGAAGATCAAGTCGGCCGTGTTC 990 

W 31,5 

300 PGLQGGPKNHT1AGXAVALR 319 

991 CCAGGACTTCAAGGTGGACCACACAATCACACTATTGCTGGAATTGCCGTTGCTTTGAGA 1050 

320 QC&SEDFVQYGEQVLKNAKT 339 

1051 CAATGCCTTTCTGAAGATTTTGTTCAGTACGGAGAGCAAGTGTTGAAGAATGCC^UWVCC 1110 
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340 LAERMKKHGYALATGGTDNH 359 

1111 TTGGCCeAG^C^TGAAGAAGCATGGATATGCTTTGGCMCCGGAGGMCCGAC^TCAT 1170 

360 LLLVDLRP IGVEGARAEHVL 379 

1171 TTGTTGCTTGTTGACTYGCGCCCAA TCGGAGTTGAAQGAGCTCG yGCTGAGCATGTGC^y 1230 

Bl 

380DLAHIACNKNTCPGDVSALR 399 

1231 GATCTCGCTCATATTGCTTGCAACAAGAATACGTGCCCAGGAGACGTTTC'TGCTTrGAGA 1290 
n 

400 JPGGI R&GTPALTS RGFQBQ0 419 

1291 CCAGGAGGCATTCGT CT C G0AACTC CAGCTCTCACCTC CCGT GGATTC CAGGAGCAAGAT 1350 

420 P E K V G 0 F I H E 429 

13 SI TTCGAGAAAGTCGGAGATTTCATTCATGAGGgtatgttttttcaactgtaatcttcaaca 1410 

_ fi . 430 GVOIAKKYNAEAG442 

JIG. 4-contmuea uu tt3atc taaaatattgatttcagGTGTTCAAATCGCAAAGAAGTACAATGCTGAGGCTGG 1470 

443 KTLKDFKSFTETNEPTKKDV 462 

1471 AMGACATT GAAAGACTTCAAATCA'TT CACT GAGACCAATGAACCATTCAAGAAAOACGT X 5 3 0 

463 ADLAKRVEEFSTKFEIPGNE 482 

1531 CG CTGATCTCGCCAAACGTGTTGAGGAGTT CTCCACGAAGTT CGAGATT CCAGGAAAC GA 1590 

483 T F * 484 

1591 AACTTTCTAATTTTAATCAT CAT CGTCAT TATCAACATXGCATTTGTATTTATAAAAACA 1 65 0 



1651 TT C CCGCCATTTTATCACTT YCATGTTT GAAT TTTAAAC AGGAGA^MjB^ T ^TTGATCT 1710 
1711 TTTCMATTTMTTGTTTTTTTATATTTAGAMGCAATTGAGATTATCTACGGCATTTGG i 7 7 0 • 



reduced (25). When the active site histidine is changed to 
aspartic acid, the activity of the physiological reactions is re- 
duced, but the activity of some alternate reactions is increased 
(26), A series of kinetic and spectral studies on these mutant 
enzymes revealed that the active site histidine is not catalyti- 
cally essential and is not the base that accepts the a proton. 
These studies have also shown that the active site histidine 
interacts with the amino acid substrate or PLP. This histidine 
is believed to have a critical role in determining reaction spec- 
ificity by determining the structure of the one-carbon binding 
site and controlling the orientation of the substrate and PLP 
ling (25, 26). Because E. coli SHMT is catalytically active 
without an active site histidine, it is unlikely that the imidazole 
ring of histidine 259 participates as an electron donor or accep- 
tor in the physiological reactions of CeSHMT. In allele tX607 t 
inserting the aromatic ring of tyrosine at the active site may 
change the environment of the PLP aromatic ring such that it 
is no longer in line with critical residues required for catalysis. 
The PLP and substrate binding pocket would also be distorted. 

The E. coli active site is VTTTTHKT. Each of the threonine 
residues in this motif was changed to an alanine to determine 
the kinetic and spectral properties of the mutant enzymes (27). 
When the first or fourth threonine is converted to alanine, the 
enzyme is essentially wild-type. When the second or fifth thre- 
onine is changed to alanine, the mutants are structurally un- 
changed but have shifted kinetic properties. These results in- 
dicated that these four threonines do not play a critical role in 
the mechanism of SHMT, However, when the third threonine is 
mutated to an alanine, the enzyme loses 97% of its catalytic 
activity. When this residue is mutated to a serine, the activity 
is essentially wild-type, so the presence of a hydroxyl-contain- 
ing side chain is very important. These studies also revealed 
that none of these threonines bind PLP (27). 

The active site lysine in E, coli has also been mutated. When 
this lysine is changed to a glutamine, the enzyme catalyzes one 
turnover of product at wild-type levels, but cannot release the 
product (28). When the lysine is changed to arginine or histi- 
dine, the PLP cofactor cannot readily form the external aldi- 



mine. These results suggest that the active site lysine expels 
the product by converting the external aldimine to an internal 
aldimine and that lysine is not the base that removes the a 
proton of the substrate. 

It has been proposed that there are two bases at the active 
site of SHMT on opposite sides of the PLP ring (29). These 
bases are not the active site histidine (25, 26, 29) or lysine (28). 
A study on sheep liver SHMT has shown that there are argi- 
nine residues present at the active site, Arginine 269 and 
arginine 462 from sheep liver were protected from chemical 
modification by tetrahydrofolate binding (33). Arginine 462 is 
not conserved but arginine 269 (residue 273 in CeSHMT) is 
conserved as an arginine in all eukaryotic SHMTs and as a 
lysine in E. coli SHMT. Arginine 363 and arginine 372 ixiE. coli 
SHMT (conserved residues 404 and 413 in CeSHMT, Fig. 10 
were changed to both alanine and lysine (30). Both of the 
arginine 372 mutations had wild-type activity, suggesting that 
this residue, although conserved, is not critical for catalytic 
activity. The R363A mutant enzyme had no activity with serine 
as a substrate and could not bind serine or glycine. The R363K 
mutant enzyme had only 0.03% of wild-type activity and a 
16-fold decreased affinity for serine and glycine. The conserved 
arginine at this position is catalytically essential and believed 
to be the binding site of the amino acid substrate carboxyl 
group (30). It has been proposed that an arginine-carboxyl 
interaction might be preferred over simple charge interactions 
because the guanidium group presents charged hydrogen 
bonds rather than the single bond formed by lysine or histidine 
(33), This may explain why replacing an arginine with a lysine 
has such a dramatic effect on enzyme activity. 

The mutant phenotype of allele tl555 is caused by an argi- 
nine (Arg-102) to lysine mutation. The arginine at this position 
is 100% conserved in all sequenced SHMTs, This conservation 
suggests that the presence of the guanidine group of arginine at 
this position is essential to the enzymatic function of SHMT. In 
allele tl597> arginine 84 is changed to glutamine* This arginine 
is also 100% conserved, so the presence of a basic group at this 
position is probably also essential to the enzymatic function of 
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Fig. 4. Exon/intron structure of 
meL82(8BM$) and summary of se- 
quencing results. The overlapping PGR 
products used for heteroduplex analysis 
are shown above the gene. The wild-type 
and mutant amino acid(s) are given for 
each allele. The active site is indicated by 
a sfiaded box. 
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Table II 
meMR2 mu&ml alkies 





Allele 


Codon 
mutation 


Amino acid 
mutation 


1 


tim 


GGA ~> GAA 


Gly-313 -> Glu 


2 


tun 


GCT -* GTT 


Ala-68 -*Val 






CTT-+TTT 


Leu-146 -> Phe 


3 


U474 


GGA -> GAA 


Gly-313 -> Glu 


4 


tmo 


TOT ~» TTT 


Ser-251 -* Phe 


5 


am 


GGA -* GAA 


Gly-204 -> Glu 


6 


tl555 


AGA -> AAA 


Arg-102 -> Lys 


7 


tX576 


GGA-+AGA 


Gly-372 ~> Arg- 


8 


am 


CGG ~> OAG 


Arg-84 -4 Gin 


9 


am 


CAC ~» TAC 


His-259 -> Tyr 


10 


aeie 


GCT -* ACT 


Ala-268 -> Thr 


11 


am 


CAG ~> TAG 


Gln-171 Stop 


12 


U632 


GGA -> GAA 


Gly-406 -> Glu 


13 


am 


GGA -> GAA 


Giy-149 -> Glu 


14 


U666 


GCT -> GTT 


Ala-103~»Val 


15 


am 


TOT -> TTT 


Ser-251 -» Phe 


16 


am 


GGC -> GAC 


Gly-143 -> Asp 


17 


&$18 


GCC ~» GTC 


Ala-126 -> Val 



SHMT. Either or both of these arginine residues could be at the 
active site of the enzyme, 

Chemical modification studies on the cysteine residues in 
rabbit liver cytosolic (31) and mitochondrial (32) SHMT, sheep 
liver cytosolic SHMT (5), and B, coli SHMT (34) have shown 
that there are no disulfide bonds in eukaryotic homotetramera 
or prokaryotic homodimers, and that PLP and substrates pro- 
tect a catalytically essential active site cysteine. There are no 
cysteine mutations in our collection, but residue 99 in the 
SHMT consensus sequence is a conserved cysteine in all the 
eukaryotic isoforms. This residue could be the important active 
site cysteine, 

In a series of experiments to study folding intermediates of 
E. coli SHMT, a set of mutant proteins was constructed in 
which the three tryptophan residues were replaced with phe- 
nylalanine residues (35). The three double mutants and a triple 
mutant were essentially wild-type enzymes, with only minor 
structural differences, Tryptophans 16, 183, and 385 in E, coli 
correspond to positions 37, 214, and 426 in the SHMT consen- 
sus sequence (Fig. 1), These tryptophans are not conserved, 
but, intriguingly, position 37 in the SHMT consensus sequence 
contains five out of six aromatic residues; position 214 has six 
out of six aromatic residues, with all five eukaryotic forms 
having a tyrosine; position 426 also has six out of six aromatic 
residues, tryptophan in E. coli and a mixture of phenylalanine 
and tyrosine in the eukaryotic forms. Clearly, the presence of 
an aromatic side chain at these positions is important. 

It is assumed that each of the mel-32 mutations abolishes, or 



greatly reduces, the SHMT activity, resulting in the observed 
maternal effect lethal phenotype. We hypothesize that a prod- 
uct of SHMT, such as glycine or some byproduct of the one- 
carbon metabolic pool, is required for embryonic development 
in the egg. The essential metabolite is normally supplied by the 
diet but cannot diffuse through the eggshell, mel-32 homozy- 
gotes have enough maternally provided SHMT activity to de- 
velop and hatch into feeding larvae, where the diet can supply 
enough of the required nutrient. These mel-32 homozygotes, 
however, have no SHMT activity to pass on to their offspring, 
which quickly use up the required metabolite pool in the egg 
and arrest as embryos, This hypothesis can also explain why C, 
elegans is unique in its requirement for SHMT activity for 
survival. The SHMT mutations in E. coli, S. cerevisiae, and 
Chinese hamster ovary cells all result in glycine auxotrophy (3, 
4, 9, 10). In all three cases, the deficiency is rescued by addition 
of glycine. The egg shell of C. elegans, which is impervious to 
most chemicals (36), would imprison the embryo in a forced 
starvation. This hypothesis can be tested, and the missing 
metabolite identified, by supplying the developing larvae with 
metabolic precursors, for example glyoine and thymidine. 

The 17 point mutations found in C. elegans meJ-#2(SHMT) 
are clustered within the middle two thirds of the protein, the 
most conserved region, This suggests that the ends of the 
protein are not required for enzymatic function and that any 
point mutations occurring there would not be detected in a 
screen for maternal effect lethals, the likely null phenotype. 
This observation is supported by the fact that if the amino- 
terminal 25-30 amino acids of rabbit liver cytosolic SHMT are 
removed with proteases the enzyme remains catalytically ac- 
tive and structurally stable (37). Sixteen of the mel'32(SBM!T) 
alleles contain a single base pair substitution, and one, tl473, 
contains two base pair substitutions. All of the mutations are 
G/C to A/T transitions, as expected in EMS-induced alleles. 

meU32 has 17 alleles, making it a high frequency hit gene, as 
most Mel genes only have one or two alleles. Severallarge scale 
screens for maternal effect lethals have been carried out which 
have produced hundreds of Mel alleles (38-40). c ' 7 Most of 
these mutations have only been identified genetically, but a few 
have been cloned and sequenced, Most of the identified Mel 
genes are involved in polarizing the embryo or deteimining cell 
fate, For example, par (for partitioning defective) mutant em- 
bryos arrest as amorphous masses of differentiated cells (39). 
par-1 encodes a conserved Ser/Thr kinase, and the two alleles 
that have been sequenced reveal mutations in invariant kinase 
domain residues, suggesting that PAR-1 kinase activity has an 
essential function (41). par-2 encodes a 628-amino acid protein 
with a putative ATP binding site and zinc ring finger domain, 
Two sequenced alleles ofpar-2 introduce stop codons that trun- 
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cate the protein to a form lacking the ATP binding domain (42), 
The pie*l gene encodes a zinc finger protein, which interferes 
with transcription (43). The only sequenced allele of this gene 
contains a 217-bp deletion in the 5' end of the gene (44), The 
mex-X gene contains two copies of the zinc finger domain found 
in pie-1 and mutations in mex-1 alter the fates of some somatic 
blastomeres, Two deletion mutants have been sequenced; one 
deletes the NH 2 -terminal 36 amino acids, and the other deletes 
the COOH-terminal 80 amino acids (45), The mex-3 gene con- 
tains two RNA binding KH domains and regulates blastomere 
identity in embryos. One allele of mex-3 is deleted for the first 
92 bp of coding sequence, whereas three other alleles have 
point mutations that change conserved glycines in the KH 
domain (46). The transcription factor skn~l is required for 
correct specification of cell fates. One mutant form ofskn-l has 
a stop codon introduced in the DNA binding domain (47, 48). 
Finally, the apx-1 gene of C. elegans controls early cell fates 
and is a homolog of the Drosophila Delta and Serrate genes. 
One mutant allele oiapx-1 has a 900-bp deletion in the center 
of the gene (49). 

The extreme degree of conservation in SHMT across millions 
of years of evolution suggests that the conserved residues must 
play some catalytically or structurally essential roles. Several 
of these important residues have previously been identified. 
From our collection of 17 mutations* only one, the active site 
histidine, has been mutated previously. Therefore, this work 
defines 13 new residues (not including the nonsense and two 
repeated mutations) that are potentially essential to SHMTs 
catalytic activity or structural integrity, 

SHMT is believed to have a folding pattern similar to aspar- 
tate aminotransferase and dialkylglycine decarboxylate, both 
of which have crystallographie structure data available (50), 
Three out of four of the residues conserved in all aminotrans- 
ferases are conserved in SHMT. In the CeSHMT numbering 
scheme, these residues are: aspartic acid 231, which hydrogen 
bonds to Nl of PLP; lysine 260, which forms a Schiff base with 
PLP; and arginine 404, which hydrogen bonds with the a-car- 
boxyl group of the substrate. Future crystallographie analysis 
of SHMT may allow characterization of more essential resi- 
dues. The residues corresponding to arginine 404 and lysine 
260 have been mutated in E. coli (Fig. 1) and shown to be 
essential (28, 30), but there are no known mutations affecting 
aspartic acid 231. 

Injecting SHMT subclones containing site-directed muta- 
tions for aspartic acid 231, and other conserved residues, into 
meU2(SHMT) nulls may allow a quick assay of enzymatic 
function, If the mutated subclone rescues, then the mutant 
SHMT is functional; if it does not rescue, then we can assume 
the mutation has abolished or decreased SHMT function, C, 
elegans could also be used to test chemotherapy drugs directed 
against SHMT. If shells of wild-type eggs are solubilized, and 
the embryos treated with drugs, any drug that reduces or 
eliminates SHMT function should cause a Mel phenotype. 
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Rat liver D-3-phosphoglycerate dehydrogenase was purified to 
homogeneity and digested with trypsin, and the sequences of two 
peptides were determined. This sequence information was used 
to screen a rat hepatoma cDNA library. Among 11 positive 
clones, two covered the whole coding sequence. The deduced 
amino acid sequence (533 residues; M r 56493) shared closer 
similarity with Bacillus subiilis 3-phosphoglycerate dehydro- 
genase than with the enzymes from Escherichia coli, Haemophilus 
influenzae and Saccharomyces ceremsiae. In all cases the similarity 
was most apparent in the substrate- and NAD + -binding domains, 
and low or insignificant in the C-terminal domain. A cor- 
responding 2.1 leb mRNA was present in rat tissues including 
kidney, brain and testis, whatever the dietary status, and also in 
livers of animals fed a protein-free, carbohydrate-rich diet, but 



not in livers of control rats, suggesting transcriptional regulation. 
The fuhMength rat 3-phosphoglycerate dehydrogenase was ex- 
pressed in B. coli and purified. The recombinant enzyme and the 
protein purified from liver displayed hyperbolic kinetics with 
respect to 3-phosphoglycerate, NAD + and NADH, hut substrate 
inhibition by 3 -phosphohydroxy pyruvate was observed; this 
inhibition was antagonized by salts. Similar properties, were 
observed with a truncated form of 3-phosphoglycerate dehydro- 
genase lacking the C-terminal domain, indicating that the latter 
is not implicated in substrate inhibition or in salt effects , By 
contrast with the bacterial enzyme, rat 3-phosphoglycerate 
dehydrogenase did not catalyse the reduction of 2-oxoglutarate, 
indicating that this enzyme is not involved in human D- or l- 
hydroxyglutaric aciduria. 



INTRODUCTION 

D-3-Phosphoglycerate dehydrogenase (EC 1,1.1.95) catalyses the 
first step in the pathway of serine formation from glycolytic 
intermediates (reviewed in [1]). This enzyme is widely distributed 
in organisms and in tissues. In rat liver, its activity depends 
strongly on nutritional status, being low in animals fed a normal 
diet and increasing more than 10-fold upon ingestion of a low- 
protein, carbohydrate-rich diet [2,3). When measured in the non- 
physiological direction, the enzyme present in human fibroblasts 
is inhibited by concentrations of the substrate 3-phospho- 
hydroxypyruvate above 10 /iM, and this inhibition is released by 
salts (4), Similar properties have been described for bovine liver 
D-glycerate dehydrogenase, the inhibitory substrate being 
hydroxypyruvate in this case [5,6], but were not observed with 3- 
phosphoglycerate dehydrogenases from bacteria [7] or plants (8). 
In Escherichia coli [9], Bacillus subtilis [7] and plants [8], 3- 
phosphoglycerate dehydrogenase is subject to allosteric control 
by the terminal product of the pathway, serine, whereas such 
control does not appear to be present in the animal enzyme [10). 

Recently, the possible involvement of 3-phosphoglycerate 
dehydrogenase in the pathogenesis of two newly described 
hereditary disorders has attracted our attention. The activity of 
this enzyme was indeed markedly decreased (to 13 and 22 % of 
the normal value) in fibroblasts from two patients with a 
decreased concentration of serine in the plasma and cerebrospinal 
fluid [4], Furthermore, £. coli 3-phosphoglycerate dehydrogenase 
was shown to catalyse the reduction of 2-oxoglutarate to both 
the P- and L-isomers of 2-hydroxygIutarate, suggesting that a 
mutation of the human enzyme may contribute to the neuro- 
metabolic diseases d- and L-hydroxyglutaric aciduria [H]. 



The primary sequences of 3-phosphoglycerate dehydrogenases 
from B. coli{\2) } Haemophilus influenzae (GenBank accession no. 
L45106; [13)), Saccharomyces cerevisiae (GenBank P40054) and 
Bacillus subtilis (GenBank L47648; [14]) are known, as is the 
three-dimensional structure of the E. coli enzyme [15). Each 
subunit of the tetramcric protein has three distinct domains: a 
nucleotide-binding domain (residues 108-294), a substrate-bind- 
ing domain (residues 7-107 and 295-336) and a regulatory 
domain, which binds L-serine (residues 337-410). The main 
contact points between the subunits are at the level of the 
coenzyme-binding domains and the regulatory domains, indi- 
cating the importance of these zones for the tetramerization of 
the enzyme. 

The aim of the present work was to initiate an investigation of 
the mammalian enzyme at the molecular level, and to study its 
kinetics and specificity, particularly its ability to catalyse the 
reduction of 2-oxogIutarate to 2-hydroxyglutarate, and the 
mechanism by which its activity is controlled by diet. 

MATERIALS AND METHODS 
Materials 

3-Phosphoglycerate, NAD\ NADH, the random prime DNA 
labelling kit and Tag and Pwo DNA polymerases were from 
Boehringer Mannheim. 3-Phosphohydroxypyruvate was pre- 
pared from the tricyclohexylammonium salt of the dimethyl ketal 
derivative [16] purchased from Sigma. [#- a *P]dCTP (approx. 
3000 Ci/mmol), [a- H S]dATP (> 1000 a/mmol), the A Taq 
Cycle Sequencing kit (USB) and Hybond N* membranes were 
from Amersham International. The T7 Sequencing® kit was 
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from Pharmacia Biotech Inc., and the IRD41 dye-labelled M13 
reverse primer was from LI-COR. 

Enzyme purification from rat liver and amino acid sequencing of 
two tryptic peptides 

Male Wistar rats were fed on cornflour for 2 weeks before being 
killed to increase the expression of the liver enzyme [2], The livers 
of 14 rats . were homogenized in 4 vol. of 0.25 M sucrose 
containing 20 mM Hepes, pH7.5, 100 mM KC1, 1 mM dithio- 
threitol and 0.25 mM PMSF. The homogenate was spun for 
30 min at 9000 $ and 4 °C. A 7-22% (w/v) polyethylene glycol) 
8000 fraction was prepared from this extract and dissolved in 
200 ml of 10 mM Hepes, pH 7.5, 1 mM dithiothreitoi and 1 mM 
EDTA (buffer A) supplemented with 0,25 mM PMSF, The 
preparation was clarified by centrifugation and loaded on to a 
DEAE-Sepharose column (2.5 cm x 22 cm; flow rate 2.5ml/ 
min)« The column was washed with 100 ml of buffer A containing 
0,25 mM PMSF and eluted with a linear salt gradient (0-400 mM 
KC1 in 400 ml of buffer A containing 0.25 mM PMSF, 1 /tg/ml 
leupeptin and 1 //g/ml antipain). The enzyme was then diluted 
13-fold in buffer A and loaded on to an AMP-Sepharose column 
(I cm x 3 cm) equilibrated with buffer A, The enzyme was eluted 
from the affinity column with 0.125 mM NAD" 1 and 0.125 mM 3- 
phosphoglycerate in buffer A. In order to obtain internal tryptic 
peptides, the band corresponding to 3-phosphoglycerate de- 
hydrogenase in the purified preparation was cut from a Coo- 
massie Blue-stained SDS/polyacrylamide gel and the protein 
was concentrated in agarose for trypsin digestion [17]. Peptides 
were purified by narrow-bore HPLC for microsequencing [17]. 

Enzyme assays 

3-Phosphoglycerate dehydrogenase was assayed spectrophoto- 
metrically at 30 °C in a mixture comprising 25 mM Hepes, 
pH7.1, 90 iM phosphohydroxypyruvate, 90 /*M NADH and 
400 mM KCl, unless otherwise indicated. One unit is the amount 
of enzyme catalysing the reduction of 1 ^mol of phospho- 
hydroxypyruvate/min under these assay conditions. For the 
determination of kinetic constants in the physiological direction, 
activity was measured in a mixture containing 200 mM Tris, pH 
9, 180 mM hydrazine sulphate, 1 mM EDTA and the indicated 
concentrations of 3-phosphoglycerate and NAD + . 

Amplication of a cDNA fragment by PCR with degenerate 
oligonucleotides 

A library constructed from rat FT02B hepatoma cells in IM- 
Zap® XR (Stratagene), with oItgo(dT)-primed cDNAs inserted 
between EcdKl and Xltol restriction sites, was kindly provided 
by V. J. Dupriez (I.C.P. Brussels) [18). The DNA purified from 
this library was used as a template to amplify a cDN A fragment 
with Tag DNA polymerase. The sense primer [5 / -GC(C/T)G- 
G(A/C)AC(A/C)GG(A/C)GT(G/C)GA(C/T)AA(C/T)GT- 
(G/C)GA-3'] corresponded to the first nine amino acids 
(AGTGVDNVD) of one of the sequenced peptides (see Figure 
1); the antisense primer was a 20-mer corresponding to the 
vector-specific T7 promoter. The amplified product (1450 bp) 
was purified, cloned in a T vector prepared from pBluescript (19] 
and sequenced. 

cDNA cloning and sequencing 

About 300000 plaques from the cDNA library described above 
were plated and lifted on to Hybond N® membranes. The DNA 
was cross-linked to the dried filters by UV irradiation. Prehybriz- 
ation for 1 h and hybridization for 15 h were performed at 65 °C 



in a solution containing 6x SSC (1 x SSC is 0.15 M NaCl, 
0.015 M sodium citrate, pH 7.0), 5 x Denhardt's solution [20], 
0.5% SDS and 20/ig/ml denatured herring sperm DNA. An 
approx. 1000 bp Xhol restriction fragment corresponding to the 
y end of the cDNA was radiolabeled with [«- M P]dCTP by 
random priming [20] and used as a probe. The membranes were 
washed at 65 °C (2x SSC, 3x10 min; 2x SSC/0.1% SDS, 
J x 10 min; 2 x SSC, I x 10 min; 0,2 x SSC, 1 x 10 min), dried 
and exposed to Kodak XAR-5 film at room temperature for 
2-18 h. 

Recombinant pBluescript cDNAs were excised in vivo from 
the lambda Uni-ZAP® XR vector as recommended by the 
manufacturer and purified by the alkaline lysis method [20]. The 
plasmid containing the longest insert was sequenced completely 
in both directions by the dideoxy method [21] with the T7 
Sequencing^ kit, [a- 3 *$]dATP and specific primers from within 
the cDNA sequence and from flanking regions of the vector. In 
some cases, the A Taq Cycle Sequencing kit^ and the IR-dye~ 
labelled M13 reverse primer were used and the products were 
analysed using an automated laser fluorescence DNA sequencer 
4000L from LI-COR. Sequences were analysed using the DNA 
Strider programme [22]. Multiple sequence alignment was per- 
formed using the GenBank program PILEUP. 



Northern blots 

Total RN A was isolated from various tissues by the guanidimum 
isothiocyanate/CsCI procedure, subjected to electrophoresis in 
1 % agarose/formaldehyde gels and transferred by capillarity to 
nylon membranes [20]. These were hybridized and washed as 
described above. 



Expression and purification of recombinant 3-phosphonJycerate 
dehydrogenase 

The coding region of the full-length cDNA clone of 3-phospho- 
glycerate dehydrogenase was amplified by PCR using Pwo DNA 
polymerase, a low-error enzyme, with one primer (GGAATTC- 
CA TATG GCCTTCGCAAATCTG) containing the start codon 
(underlined) inserted in a Ndel site, and a second primer 
(CGGGATCCTTCAGAAGCAG AACTGGAA) containing the 
stop codon (underlined) flanked by a BamHl site. The amplified 
fragment was cloned into pBluescript restricted with EcoKV, 
excised from this vector with Ndel and BatnHl and inserted into 
the expression vector pET3a (23] to produce the recombinant 
plasmid pET3PGDH. For the preparation of the shortened form 
of the enzyme, the second primer used in the amplification 
(GGTGTGGGATCCTCAGGCACTGGTAAG) was designed 
to replace Phe-325 by a stop codon (underlined). To produce the 
recombinant enzymes, BL21(DE3)p£y.?S cells transformed with 
the expression plasmids were aerobically grown in 1 litre of LB 
medium containing 0.2% glucose and 50/ig/ml ampicillin at 
37 °C up to an A WQ of 0.6 unit. The culture was then quenched 
for 30 min on ice before addition of 0,4 mM isopropyl /9-d- 
thiogalactoside to induce expression of the cloned enzyme. 
Culture was continued for another 18 h at 18 °C. Cells were 
harvested by centrifugation (12000 g, 30 min, 4 °C) and extracted 
as described in [24]. The recombinant enzyme was purified by 
chromatography on DEAE-Sepharose and AMP-Sepharose, as 
described for the native liver enzyme. The active fractions were 
diluted with 1 vol. of glycerol and stored at -20 °C, under which 
conditions the enzyme was stable for several months. 

Protein was determined by the procedure of Bradford [25], 
with bovine y-globulin as a standard. 
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RESULTS AND DISCUSSION 

Purification of rat liver 3-phosphoglycerale dehydrogenase and 
screening of the cDNA library 

Rat liver 3-phosphoglycerate dehydrogenase was purified to 
homogeneity from the livers of rats that had been fed for 2 weeks 
with a protein-free, high-carbohydrate diet, by a procedure 
derived from that used by Lund et al. [26), The purified 
preparation had a specific activity of 35 units/mg of protein at 
30 *C and contained a single polypeptide chain of M t 57000 in 
SDS/polyacrylamide gels (results not shown). The protein was 
concentrated and digested with trypsin. Tryptic peptides were 
purified, two of which gave the following sequences; AGTGVD- 
N VDLEAATR (peptide 1) and ALESGECAGAALDVFTEEPP 
(peptide 2). PCR-amplification with a primer derived from 
peptide 1, a primer corresponding to the T7 promoter and DNA 
from a hepatoma cDNA library yielded an approx. 1450 bp 
fragment, which was cloned and found by sequencing to encode 
a protein homologous to E, coli 3-phosphoglycerate dehydro- 
genase. 

This ~ 1450 bp fragment was used to screen the hepatoma 
cDNA library. The 1 1 positive clones that were obtained had 
inserts of between 0.63 and 1.78 kb that were differently truncated 
at their 5' ends. The largest clone was sequenced over its entire 
length on both strands. 

Nucleotide sequence and deduced amino acid sequence 

The nucleotide and predicted amino acid sequences are shown in 
Figure 1. The proposed ATG start codon lies within a suitable 
consensus sequence for translation initiation by eukaryotic 
ribosomes [27]. It is preceded by an in-frame TAA stop codon 
and opens a reading frame encoding a protein of 533 amino acids 
with a predicted M t of 56493 and a pi of 6.26. These values are 
in agreement with the behaviour of the liver enzyme in SDS/ 
PAGE (57000 Af r ) and in chromatography on ion exchangers. 
The enzyme is indeed retained by DEAE-Sepharose at pH 7.5 
and by SP-Sepharose at pH 6.0 (results not shown). The next 
ATG codon is situated about 285 bp downstream. Its use would 
result in a peptide with a much lower M r than observed (approx, 
45000). The open reading frame is preceded by 68 bp of 
untranslated region and followed by a 3'-untranslated sequence 
of 11 3 bp containing an 18 bp poly(A) tail. The putative, 
somewhat atypical, poly(A) addition signal AGTAAA starts 
18 bp upstream of the poly(A) tail. 

The amino acid sequences of the two peptides resulting from 
digestion with trypsin (Figure 1) match the sequence of the 
recombinant enzyme, with the exception of two glutamine 
residues that were detected as glutamic acid during sequencing of 
the second peptide, presumably as a result of deamidation. These 
data indicate that the cloned cDNA encodes the subunit of the 
liver D-3-phosphoglycerate dehydrogenase. Partial sequences 
obtained from the other, incomplete, cDNA clones were con- 
sistent with that shown in Figure 1 , indicating that the enzyme is 
expressed as a single molecular form in FT02B cells, 

Comparison with related enzymes from other species 

The predicted sequence of the rat liver enzyme has been aligned 
with 3-phosphoglycerate dehydrogenases from E, coli, H. iitflu- 
enzae, S. cereoisme and B. subtilis (12-14] (Figure 2). The Figure 
indicates the positions of different domains in the £. coli enzyme, 
as well as amino acids implicated in catalysis. GenBank also 
contains the sequences of mouse and human partial cDNAs 
encoding peptides displaying > 90% identity with rat liver 3- 
phosphoglycerate dehydrogenase (not shown). 



GCOITCAGXXTCCTOTACTAAGa-GCTTCTCCCCACCAGAGCAACCGATtCrAWGCCTGG $0 
CTCTAGCAAlCGCCTICGCAAATCTGCGCAAAATACT<^TCAGTGATfl(5CCTCGACCCCT 120 

MAfXHLRKJLlStJSLDP 17 
GCTGCCGGAAGAtCCIGCXAGW-GGAGGGCXGChGGlGGTGGAGAAGCAGAACZXGAGCA 1 SO 

CCRKXLQDGGLOVVBKONLS 37 
AGGRGGRGCTGAl'AGCCGMCTCCKGGACXGIGfeAGGCC'ETXTCGTCCGGTCAGCTACTA 240 
KJBBLIAEL0DCR6I.XVRSAT 57 
AGGlCAClGCtGAtGXCATCAATGCAGCAGAGAAGCTCCAGGTGGTGGGCAGGQCTGGTA 300 
KVTADVINAAEKLQVVGR , ft ft, 77 
CAGGCGtGGACAATGXGGATCTGGAGGCTGCCACAAGGAAGGGCGTCCTCGTCAXGAACA 3 $0 

*ft G V T) M v r> 7, F. A A T R X G V & V M V 97 
CCCCCAATGGAAATAGCCTCAGTGCXGCGGAACXCACCTGTGGGATGCTCATGTGCCXGG 420 
TFHGNSfeSAAELTCGMLMCL U7 
CCAGGCAGATCCCCChGGCGACGGCTTCGATGAAAGAlrGGCAAATGGGACCGGAAGAAGT 480 
ARQIPOAXASMKDGKtfDftXK 137 
TCATGGGGACAGAGCTGAACGGGAAGACACTGGGAATXCTTGGCC tGCGCAGAATXG GAA 540 
FMGTELNGKXLGX LGLGRI G 157 
GAGAGGTGGCCGCCCGAAX'GCAGGCCXXTGGAATGAAGACTGTAGGCXATGACCCCAICA 600 
REVAARMQAFGHXXV6YDPI 177 
XTrCTCCAGAAGTCGCTGCCTCCTTXGGTGTTCAGCAGCTGCCGCTAGAGGAGAXCXGGC 660 
I8PBVAASFGV001'P1'£EXH 197 
CXCXCTGXGAXTTCAXCACTGTCCAXACCCCGCTCCXGCCCTCCACTACAGGCXXGCXCA ' 720 
rtCfiPXTVHTPLL»£T10LL 217 
ATGACAGCACCXXXGCCCAGTGCAAGAAAGGCGTGCGGGXGGXGAACTGXGCTCGAGGAG 780 
NPSTFAQCKKGVftVVNCARG 237 
GCAXXGXGGAXGAAGGIGCCCXGCXCCGXGCCCXGCAGTCXGGXCAGXGTGCXGGXGCXG 6 40 

GXVPEGALLR , A L 0 S , 6 ,0,,C ft ff, , 257 
CACTGGATGTGXrTACAG AAGAGCCACCACGGGACCGGGCCTTAGTGGACCACGAGAACG & 00 

ALPvrTttfiPPR DRA1V&KBB 277 
TCAXCAGCXGXCCCCACCXGGGCGCCAGCACCAAGGAGGCCCAGAGCCGCXGXGGGGAGG 960 
VISCPHLGASXKEAQSRCGE 297 
AAATCGCAGICCAGIXXGXGGACAXGGXGAAGGGGAAAXCXCXAACAGG6GXTGTAAACG 1020 
EIAVQFVPNVKGKSXTGVVN 317 
CCCAGGCICIXACCAGTGCCrxCXCXCCACACRCCAAGCCXXGGKXXGGXCXGGCAG AAG 1 080 
AQALTSAFSPHTRPWIGlAfi 337 
CATTGGGCACGCTGAXGCACGCCTOGGCTGGCTCCCCrAAAGGGACCAXCCAGGTGGTGA 1140 
AfcGTLMHAWAGSPXGfXQVV 357 
CACAAGGAACAtCXCXGAAGAAXGCXGGGACCXGCCXGAGCCCXGCGGXCAXXGXCGGCC 1200 

TQQTSLKNAGTCLSPAVfVG 377 
TXCIGAGAGAAGCAXCAAAACAGGCAGATGXGAACTXGGIGAACGCXAAGCXACXGGXGA 1260 
IfcRBASKQAOVMLVHAXblV 397 
AAGAG6CIGGCCXCAATGTCACCACCXCCCACAGTCCXfl6XGXCCCAGGAGAGCAG6GCA 1320 
XEAGfcNVXTSHSPGVPGEOG 417 
XCGGGGAATGCCXCCXGACXGXGGCCXXGGC AGGTGCCCCCTACCAAGCTGXGGGCXXGG 1380 
1GEC1LTVALAGAPVQAVGL 437 
XCCAGGGCACCACACCAAXGTXGCAGAXGCTCAACGGAGCXGXCXXCAGGCCAGAGGXGC 144 0 

VQGTffPMXiQHLNGAVFRPEV 457 
CXCXACGCAGGGGCCAGCCCCXGCXCCXGXICCGGGCICAGCCCXCCGACCCIGXCAXGC 1500 
PLARGQP&J.fcPRAQPSDPVM 477 
TGCCCACXATGATCGGGCXACtGGCAGAGGCGGGGGTACAGCXGCXGXCCXACCAGACCI 15 60 
&PJMIGLLABAGV01»L$rQT W 
CCAAOGTGXCXGACGGAGACACTXGGCACGXCATGGGCCICXCCXCCCTACXGCCCAGCC 1620 
SXVSDGDtHHVMGlSSLfcPS 517 
IGGACGCAXGGAAGCftGCAXGXXXCTGAGGCTXXCCAGTTCXGCXICISACCCAGGGGCI 1680 
LOAHKOHVSEAFOFCF 533 
CAGCGGXCCCAGCCCCXGAGGCXCXTCXGAGGAAACCQQGXCAGXGTGAGGlGAACtAAT 1740 
AXCXAGiAAAGAATXXMCXCCAAAAAAAAAAAAAAAAAA 1780 



Figure 1 Sequence of the cDNA encoding rat hepatoma 3-phosphoglycerate 
dehydrogenase and Its deduced amino acid sequence 

The Initiator ATG codon, the stop codon and (he putative polyadenyiation signal w underlined. 
The in-rreme TAA stop codon upstream oi the Mel initiator codon is shown in italics. The amino 
acid sequences corresponding to the two tryptic peptides are underlined. Note that the two 
predicted glutamine residues of the second peptide were recovered as glutamate by amino acid 
sequencing. 



Remarkably, the rat enzyme shows closer similarity to its B> 
subtilis counterpart than to the three other enzymes. The rat and 
B. subtilis 3-phosphoglycerate dehydrogenases are of about the 
same length, which is about 1 20 residues longer than either the 
E. coll or H, influenzae enzymes. This difference is mainly due to 
the presence of a longer C-terminal domain 200 residues, as 
compared with ~ 75 residues). Furthermore, the nucleotide- and 
substrate-binding domains of the rat enzyme show more identity 
with the B, subtilis enzyme (46 %) than with the other enzymes 
(32%). For the E> coli/S. cerevisiae pair, the corresponding value 
is even higher (53 %), indicating that the yeast enzyme is closer 
to the E. coli enzyme than to the mammalian enzyme. The 
percentage identity is much lower in the regulatory domain than 
in the other domains, and is 15% for the rut/B. subtilis pair, 
30 % for the E. coli/S. cerevisiae pair and not significant when 
the rat enzyme is compared with that from E, colL These 
observations suggest that there are two different types of 3- 
phosphoglycerate dehydrogenase that originated from a dupli- 
cation event occurring before the separation of eukaryotes and 
prokaryotes. 

Figure 2 also shows that the consensus sequence Gly-Xaa-Gly- 
Xaa-Xaa-Gly-Xaa n -Asp (Figure 2), involved in binding the 
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BCOl , , . . , > , . .MAKV . .SLSKDKIK FLLVEGVKQK ALES&RAAGY TNIEFJJKGAL DDEQLKESIR 52 

Binf , MTNKV . .SLDKSKIK FVLFEGVHQS ALDTLHAAGY TNIDYYKKAL DGDELKEAIK 53 

Seer MTSXDINNLO. NTFQQANNMS GSPGAVCTSP TQSFMNTVPO RLNAVKBPKX LKPFSXGDMK ILLLENVNQT AXTIFEEQGY , QVEFYKSS1* PEBELIEKIK 99 

Bsub «FK VLVSDKMSND GLOPLIESDF IEIV. .QKNV ADAE, .DEM 39 

Kat MAFANLRK ILXSDS&DPC CRKILQDGGI* QWE..RQNL SKEELIAEX.0 46 

Substrate binding domain 1 

** * * 

EOOl PABFIGIjRSR TBl/EEDVINA AEKXiVAIGCf CIGTNQVDLD AAAKRGIPVF NAPFSHTRSV AKLVIGELLL LLRGVPEANA KAHRGVWHKi. AAGSFEAHGK 152 

Hinf DVHFIGL&SR SBI/TAEMIEA APKLIAVGCF CIGTN&VDU* AAKARGIBVF NAPFSNXRSV AELVLGBXLL LMRNVPQAHA EVHRGVWWKS ATGSflEVRGK 153 

Seer DVHAIGIRSK XRLXSNVt^H AKNXVCIGCF CIGTNQVDLD YATSRGIAVF NSPFSNSRSV AEIWAEIIS LARQ1GDRS1 ELKTGTWNKV AARCWEVRGK 199 

Bsub XFDALLVRSA SKVtfEDLFNK MtSLKIVGrA GVGVDNXDID EATKHGVIVX NAPNGNIIST AEHTFAHISS LmRKIpGaMX SVKSREWNRT AYVGSelYGK 139 

Rat DCEGLIVRSA TKVTADVIKA AEKLQWGRA GTGVDNVDLE AATRKGVLVM JWPNGNSLBA AELTCGMLMC LARQIPQATA SWKDGKWDRK KFHGTEkNGK 146 



Nucleotide binding domairu 



If 

Bcol JOGXIGYGHX GTQkGXLAES LGMYVYFYD , , IENKLPLGN ATQVQHX/SDIj IMSDWStH VPENPSTKNM MGAKBXSLMX PGSLLIHASR GTWDIPALC 250 

Kinf KLGIIGYGHI GSQLSIIAES LGMDVYFYD, . IENKLPLGN AKQVRSLEEL LSSCDWSLH VPELPSTKNL MNVARlAQLX QQAILINAAR GTWD1DALA 251 

Seer TLGlIGYGHX GSQLSVUBA MGLHVLYYD, . XVTIMALGT ARQVSTLDEL LNKSDFV21H VPATPETEKH LSAPQFAAMK DGAYVIMASR GWDXPSlI 297 

Beub TLGIVGLGRX GSEIAQRRGA FGMTVHVFDP FLTEERAKKI GVMSRTFEEV LBSADIITVH TPLTKETKGL INKETIAKTK KGVRIrXNCAR GGHDEAAH* 239 

Rat TLGILGLGRX GREVAARMQA FGHKTVGYDp IISPEVAASF GVQQLPLEEI WPLCDFITVH TPLLPSTTGL LNDSTFAQCK KGVRWNCAR GGJVDEGAlb 246 

.,,. ,„, ,„, , , , „, , , . ■,„, , ■.„ ,, ■,„!. .. . fii^^rfti-s binding domain 



Ecol DAIASKHLAG AAIDVFPTEP ATKSDP FTSPLC EFDNVLWPH IGGSTQEAQE HIGLBVAGKL XKYSDNGSTL SAVWFPEV. . ,,SLP,,..L 334 

Hinf QALKDGKLQQ AAIDVFPVEP ASINEE FISPI»R EFDNVILTPH IGGSTAEAQE NIGFEVAGKF VKYSDNGSTL SSVNFPEV, , ,,SfcP....E 335 

Scfcr QAVKANKXAG AAXDVYPHEp AKNGEGSFND ELNSWTSELV SliPNIILTPH IGGSTEEAQS SXGIEVATAX, SKYIHEGHSV GSVNFPEV.. . .AlKSLSYD 393 

Bsub EALENGHVAG AALDVFEVEP PVDN KXV DHPLVlATPH 1GASTKEAQL NVAAQVSEEV LOFAKGtPVH SAIKLPAM'JK DEFAKlKPYH 326 

Rat RA1QSGQCAG AALDVFTEEP PRDR, . , , ALV DHENVISCPH LGASTKEAQS RCGEEIAVOF VDMVKGKSX/f GWNAQAtTS AFSPHTKPWI 333 

1 Regulatory domain J 

BCOl HGG.RRLMHI HENRPGVLTA LHKIFAEQGV NIAAQYIiQTS AQMGYV.VID IEA.DEDVAE KALQAMKAIP GTIRARLLY 409 

Hinf HEGTKRLLHI HENRPGXLNK X,WQIFVBAWD NIAAQYLQ1D PKXGYV.WD VBT.ND. .AS PLliTKtKEXD GtlRARVLY 410 

Scet QENTVRVX»YI HQNVPGVLKT VNDILSNH. , NIEKQFSDSN GBIAYL . MAD ISSVDQSDIK DIYEQLHQTS AKISIRLLY 469 

Bsub QIAGKIGSLV SQCMKEPVQD VAXQYEGTIA KLETSFITKA LIvSGFLK-PR VDSTVNEVHA GGVAKERGIS FSEKISSSES GYDNClSVKV TGDRSTFTVT 425 

Rat GlAEALGTLM HAWAGSPKGT IQWT06TSL KNAGXCt,SPA VlVGLtREAS KOADVNLVHA KLLVKEAGLN VTTSHSPGVP GBQGXGECIit. TVALAGAPYO 433 

Bsub AXYIPHFGER XVEINGFNID FYPTGHLVYI OHQDXTGVXG RVGRILGDND INXATHOVGR KEKGGEAXHM tSFDRHLE.D KIVKELXNVP DIVSVKLIDLp 525 

Rat AVGliVQGXTP HMWLNGA.V FRPEVPLRRG QPLIiLFRAQP SDPVMiPTHl Gl/LAEAGVQL L$YQTSKV$D GDTHHVMGL3 SLJ-PSI-DAHK QHVSBAFQFCF 533 



Figure 2 AHgnment of the sequoncas of 3-phosphoplycerate dehydrogenase from various species 

The amino acid sequences of the enzyme from £ co!i (Ecol; (12)), H. influenzae (Hinf; GenBank accession no. L45106; (13)). S. cercv!slae{$w: GenBank P40054), B, sums (Bsub: GenBank 
147648; [14]) and rat hepatoma (the present work) are displayed In singie-lelier code alter alignment for maximal ideniiiy. Gaps are indicated by dols and slrictly conserved amino acids are in 
bold. Asterisks indicate residues surrounding Ihe catalytic pocket, where hydride transter occurs. Residues proposed to be involved In proton shuttling ( h-) and the argintne presumably Involved In binding 
of ihe carboxylale group of the substrate (Y) (15) are also shown. The domains ol the £ wil enzyme are shown above Ihe alignment 



adenosine portion of NAD + [28], is located in the rat enzyme at 
residues 152-175* Several residues lining the catalytic pocket in 
the B. coli enzyme are strictly conserved in the mammalian 
enzyme (Arg-60, Ser-61, Asn-108 and Gln-301) or replaced by 
functionally similar residues (Lys-141 replaced by Arg-136). By 
analogy with the B, coli enzyme, it is likely that Arg-236 binds the 
carboxy group of the substrate and that His-292 and Glu-269 
function as a proton shuttle [15], 

Northern blots 

Northern blot analysis of total RNA indicated the presence of a 
specific ~ 2.1 kb mRNA in livers derived from animals main- 
tained for 3 days on a protein-free, high-carbohydrate diet 
(Figure 3). The length of this mRNA is consistent with the size 
of the longest clone that we obtained, allowing for the presence 
of the poly(A) tail. Strikingly, the mRNA was not detectable in 
the livers of animals fed on a normal diet, suggesting that control 
of the expression of 3-phosphoglycerate dehydrogenase is exerted 
at the level of transcription or by stabilization of the mRNA. A 
2.1 kb mRNA was also observed in kidney (Figure 3), testis and 
brain (results not shown), and in lower amounts in lung (Figure 
3), skeletal muscle and heart (results not shown). In these tissues, 
no effect of dietary manipulation on 3-phosphoglycerate de- 



hydrogenase mRNA was observed. These results are in agreement 
with the observation that the induction of the enzyme by a low- 
protein diet is restricted to the liver [29], 



Expression of recombinant rat 3-phosphoglycerate deydrogenase 

The isolated cDNA was expressed in E> coli to verify that it 
encoded a protein with similar properties to the enzyme purified 
from liver, Addition of isopropyl thiogalactoside to BL21- 
(DE)pLys§ cells carrying the expression plasmid resulted in the 
induction of 3-phosphoglycerate dehydrogenase, the activity of 
which reached about 8 units/mg of protein after 18 h at 18 °C (as 
compared with 0.01 unit/mg of protein in control cells). The 
recombinant protein was purified by chromatography on DEAE- 
Sepharose, and was nearly homogeneous after this single step, 
corresponding to a polypeptide with the expected size (M T 57000 
by SDS/PAGE; results not shown). After AMP-Sepharose 
chromatography, the preparation had a specific activity of 35 
units/mg of protein. From 1 litre of culture, 4700 units of 
purified enzyme was obtained, corresponding to a 50% yield. 
Upon gel nitration on Sephacryl $-200, the enzyme was eluted 
with an M r of approx. 220000 (results not shown), indicating that 
it behaves as a tetramer, like the enzyme from E. coli [30], rabbit 
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Figure 3 Northern blot analysis of mRNA from liver, kidney and lung from 
rats fed either a normal diet or a protein-tree, carbohydrate-rich diet 

Rats either were fed laboratory pellets (Cont.) or were maintained lor 3 days on a protein-free, 
high-carbohydrate die! (PF-HC). Total RNA was extracted and subjected to electrophoresis 
(15 ^j/lane). for each condition, samples from two different animals are shown, 




(3- Mtydroxy pyruvate] (nM) 

Figure 4 Effects o( 3-phospbohydroxypyruvate and KCI on the activity ot 
the recombinant full-length (A) and truncated (B) forms of liver 3- 
phosphoglycerate dehydrogenase 

The enzyme was assayed with the indicated concentrations of pho spho hydroxy pyr uvat e 
hydroxypyruvate) and KCI, In the presence of 90 /*M NAOH, 



liver [26] and chicken liver [31]. The recombinant protein purified 
on DEAE-Sepharose was used in the kinetic studies. 

Kinetic properties 

The kinetic properties of the recombinant and native enzymes 
were investigated. When measured in the physiological direction, 
both enzymes showed K m values of about 100 /M for 3- 
phosphoglycerate and 27 /iM for NAD + . In the opposite di- 
rection, the enzyme displayed a K m of 20-25 /*M for NADH, and 
was inhibited by elevated concentrations of 3-phosphohydroxy- 
pyruvate (Figure 4A). As previously reported by Jaeken ct al. [4], 



this inhibition was released by 100-400 mM KCI. The data 
obtained in the present study at low substrate concentrations 
indicate that the salt displaced the saturation curve to the right, 
inhibiting enzyme activity at low concentrations of substrate and 
stimulating activity at elevated concentrations. Similar effects 
were obtained with other salts. When measured with 90 /M 3- 
phosphohydroxypyruvate, the effect was biphasic, with an op- 
timum at a concentration close to 400 mM for univalent salts 
and 80 mM for bivalent salts (results not shown). 

We have also tested the effects of serine, which is known to 
inhibit the activity of the 3-phosphoglycerate dehydrogenases 
from B. coli {% B> subiilis [7] and plants [8], and of the other 19 
standard amino acids at concentrations up to 5mM } on the 
activity both in the physiological direction and in the reverse 
direction; none of these amino acids affected the enzymic activity. 
Since the enzyme from rat appears to be closer to the B. subtilis 
enzyme than to the other enzymes, we took the precaution of 
testing the sensitivity to serine under conditions similar to those 
described by Saski and Fizer [7], i.e. by preincubating the enzyme 
with serine in the presence of dithiothreitol. No effect was 
observed under these conditions. 

Properties of a truncated form of ral 3-phosphogIycerale 
dehydrogenase 

Because the inhibition by excess substrate appeared to be specific 
for the mammalian enzyme, we tested the hypothesis that the C- 
terminal domain mediates this inhibition. We therefore expressed 
in E. coll a truncated form of 3-phosphoglycerate dehydrogenase 
lacking the last 209 amino acids. The recombinant protein was 
purified to a specific activity of 14 units /mg of protein. As 
expected, its subunit migrated with an Af r of 36000 in SDS/PAGE 
(results not shown). Gel filtration on Sephacryl S-200 gave an M t 
of 80000, indicating that the enzyme behaved as a dimer and 
confirming the role of the C-terminal domain in the tetra- 
merization of the enzyme [15], However, the mutant enzyme 
displayed substrate inhibition by phosphohydroxypyruvate, and 
this inhibition was sensitive to salt (Figure 4B). These results 
indicate that the C-terminal domain of the rat enzyme is not 
involved in the inhibition by excess phosphohydroxypyruvate. 

Lack of 2-oxoglutarate reductase activity 

Since E. coli 3-phosphoglycerate dehydrogenase has been re- 
ported to catalyse the reduction of 2-oxoglutarate to L- and D- 
hydroxyglutarate [1 1], we tested whether the rat enzyme could 
catalyse a similar reaction. However, at concentrations between 
10 juM and 10 mM, 2-oxoglutarate was not a substrate for rat 3- 
phosphoglycerate dehydrogenase, whether 400 mM NaCl was 
present or not. Furthermore, at 10 mM, 2-oxoglutarate did not 
behave as an inhibitor of the enzyme in either the forward or 
reverse direction. Similar results were observed with the truncated 
enzyme. These results therefore argue against the possibility that 
3-phosphoglycerate dehydrogenase is implicated in the patho- 
genesis of d- or L-hydroxyglutaric aciduria [11]. The results 
indicate also that 3-phosphoglycerate dehydrogenase is not 
responsible for the L-2-hydroxyglutarate dehydrogenase activity 
found in rat liver [32]. 

Conclusion 

This paper describes the cloning of the cDNA encoding rat liver 
3-phosphoglycerate dehydrogenase and the confirmation of its 
identity by expression of the encoded protein in bacteria. The 
availability of this clone now opens up the possibility of studying 
the molecular basis of human 3-phosphoglycerate dehydrogenase 
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deficiency and the mechanism by which protein intake regulates 
the expression of this enzyme in the liver. 
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